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ABSTRACT 


The  Stylist  is  a  Waterloo  Pascal  program  that  analyzes  the  style  of  English  prose. 
A  "style  checker".  The  Stylist  pertains  to  Computational  Stylistics  and  Computer 
Assisted  Composition  Instruction  (CACI).  The  Stylist  creates  an  affective  model  of 
the  text  based  upon  the  following  characteristics  of  its  component  words  :  etymology, 
tangibility,  difficulty,  emotional  connotation  and  vigor.  The  Stylist  then  compares  this 
model  to  the  standards  of  fiction  or  nonfiction  texts  and  reports  results  and 
recommendations  to  the  user. 

The  Stylist  also  creates  a  concordance  of  the  user's  input  text  using  a  new  data 
structure  called  a  Concordance  Search  Tree  (CST).  A  CST  is  a  binary  search  tree  with 
a  linked  list  threaded  through  it  recording  the  order  of  the  use  of  each  word.  An 
inorder  traversal  of  the  tree,  with  a  traversal  of  the  linked  list  during  each  visit,  creates 
a  concordance. 

This  thesis  also  reviews  related  literature  and  programs. 
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I.  INTRODUCTION 

A.       BACKGROUND 

1 .  Computational  Stylistics 

Computational  Stylistics  is  the  computer-assisted  study  of  literary  style. 
Computational  Linguistics  is  the  computer-assisted  study  of  language  itself.  These 
disciplines  are  the  automated  subsets  of  Statistical  (or  Quantitative)  Stylistics  and 
Linguistics,  which  are  the  mathematical  studies  of  style  and  language.  All  of  these 
disciplines  involve  quantifying  aspects  of  language  and  then  manipulating  these 
quantities  in  an  attempt  to  gain  insight  into  how  and  why  the  language  works. 

The  history  of  stylistics  can  be  traced  back  to  1851,  when  Augustus  de 
Morgan  suggested  that  word-length  could  prove  to  be  a  distinctive  trait  of  a  writer's 
style.  [Ref.  1]  This  suggestion  prompted  T.C.  Mendenhall,  an  American  geophysicist, 
to  investigate  whether  mean  word-length  could  resolve  authorship  problems  such  as 
those  posed  by  some  of  the  disputed  Shakespeare  plays  or  the  letters  written  under  the 
pen  name  of  Junius.  Working  in  the  late  19th  century,  Mendenhall  analyzed  the  word- 
lengths  of  some  two  million  words  from  various  periods  of  English  literature,  using  a 
primitive  tabulating  device  that  spit  out  reels  of  paper.  His  results,  however,  proved 
little. 

In  the  early  20th  century',  another  possible  characteristic  of  style,  the 
frequency  distribution  of  words,  came  under  wide-spread  investigation.  G.K.  Zipf 
postulated  a  "Rank-Frequency  Law",  by  which  a  ranking  of  the  use  of  words  in  a  text 
would  show  a  constant  decrease  from  the  most-used  word  down  to  the  least-used  word. 
[Ref.  2]  Other  scholars,  such  as  G.  U.  Yule,  investigated  such  aspects  as  the  richness  of 
a  writer's  vocabulary  and  the  length  of  sentences.  [Ref.  3:  pp.  363  -  390]  By  the  1950s 
and  1960s,  the  application  of  statistical  methods  to  the  study  of  literature  had  reached 
new  heights  of  sophistication  and  complexity.  Some  scholars  were  investigating  the 
significance  of  the  ratio  of  verbs  to  adjectives  and  others  were  applying  rigorous 
statistical  techniques.  Despite  the  many  fascinating  insights  offered  by  these  lines  of 
inquiry,  their  final  significance  and  scientific  credibility  remained  a  question  of  much 
dispute.  Enemies  of  the  new  disciplines  included  Norm  Chomsky,  the  pioneer  of 
formal  languages,  who  argued  that  writing  is  a  very  human  activity  which  involves  a 
great  deal  of  chance,  and  as  such  is  not  easily  quantified.   [Ref.  4] 


Computational  stylistics  began  as  a  natural  outgrowth  of  statistical  stylistics. 
With  the  spread  of  computer  access  some  thirty  years  ago,  some  scholars  quickly 
realized  the  potential  for  their  use  in  literary  studies: 

"I  first  heard  of  computers  in  1955  when  my  wife  (a  mathematician)  told  me  that 
she  was  going  to  work  for  an  oil  company  as  a  program  analyst  ...  It  took  me  a 
while  to  understand  her  flowcharts  and  computer  programs,  but  when  I  did,  I 
realized  that  a  computer  could  be  used  to  solve  other  problems  as  well.  Since 
then,  I  have  used  a  computer  for  numerous  applications  relating  to  my  work  in 
the  English  Department  of  Cleveland  State  University,  and  have  taught  many 
others  .  .  .  the  power  of  electronic  data  processing  in  the  study  of  language  and 
literature."   [Ref.  5] 

The  first  challenge  to  the  field  of  Computational  Stylistics  was  to  translate 
literary  texts  to  electronic  data.  Much  of  the  early  literature  of  the  field  is  devoted  to 
this  basic  problem.  After  the  first  two  decades,  corpora,  or  bodies  of  literary  texts  in 
machine  readable  form,  had  been  developed.  The  Lancaster-Oslo; Bergen  (LOB) 
Corpus  is  a  structured  collection  of  500  two-thousand-word  texts  of  written  British 
English.  An  American  version  of  this  effort  is  the  Brown  Corpus,  constructed  by 
Brown  University.1 

In  addition,  many  literary  classics  have  been  painstakingly  entered  by 
keyboard.  They  are  available  from  the  Oxford  Computing  Laboratory  at  Oxford 
University  and  the  American  Philological  Association.  [Ref.  5]  With  the  increasing 
capability  and  decreasing  cost  of  optical  readers,  the  problem  of  translating  literary  text 
to  electronic  data  should  become  trivial. 

The  usefulness  of  computers  to  statistical  stylistics  quickly  became 
indisputable.  Scholars  used  to  labor  to  copy  down  on  3"  X  5"  cards  each  occurrence  of 
every  noun  used  in  a  Shakespeare  play  and  then  sort  these  cards  by  hand.  [Ref.  6:  pp. 
33  -  50]  Now  a  computer  program  could  manipulate  the  text  in  any  number  of  ways 
within  minutes.  One  of  the  other  prominent  successes  o^  the  use  of  computers  in 
literary  studies  was  the  automation  of  the  process  of  forming  concordances.  A 
concordance  is  an  alphabetical  listing  of  all  the  significant  words  used  in  a  text, 
together  with  the  enclosing  phrase.  Before  the  advent  of  computers,  scholars 
laboriously  built  concordances  of  the  Bible  and  the  plays  of  Shakespeare.  The  power 
of  the  computer  made  possible  giant  concordance-building  projects  such  as  one  for  the 


Electronic  forms  of  both  of  these  corpora  are  available  from:  The  Norwegian 
Computational  Center  for  Humanistic  Research,  P.O.  Box  53,  University  of  Bergen 
N-5014,  Bergen,  Norway 
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Dead  Sea  Scrolls.  The  Centro  Atomazione  Analisi  Linguista  in  Gallarate,  Italy,  used 
computers  to  build  this  concordance,  which  helped  to  resolve  some  of  the  missing  or 
obliterated  words.   [Ref.  7] 

One  of  the  next  important  challenges  to  computational  linguistics  was  the 
issue  of  disambiguation.  Many  words  have  more  than  one  meaning  and  can  belong  to 
more  than  one  parts  of  speech.  An  example  in  English  is  the  word  "flies''  in  the 
following  two  sentences: 

Time  flies  like  an  arrow. 
Fruit  flies  like  an  apple. 

A  human  reader  can  easily  decide  that  "flies"  in  the  first  sentence  is  a  verb  and 
that  "flies"  in  the  second  sentence  is  a  noun.  This  decision  is  disambiguation.  In  order 
to  compute  such  characteristics  as  verb-noun  ratio,  it  is  first  necessary  to  disambiguate 
the  parts  of  speech.  Recent  advances  in  computational  linguistics  have  led  to 
programs  that  can  do  this  with  more  than  90%  success.  (Ref.  8]  [Ref.  9:  pp.  139  -  150] 
[Ref.  10] 

Scholars  continue  to  argue  about  the  usefulness  of  statistical  stylistics.  Efforts 
to  discover  and  prove  laws  of  distribution  have  not  met  with  unchallenged  success. 
Nevertheless,  the  work  done  in  the  past  130  years  has  laid  some  groundwork  for  the 
use  of  computers  to  study  and  analyze  written  prose.  All  of  the  professorial  attempts 
thus  have  led  to  success  at  a  humbler  level  :  the  tutorial.  The  history  of  stylistics  is 
now  culminating  at  this  level,  in  the  form  of  Computer  Assisted  Composition 
Instruction. 

2.  Computer  Assisted  Composition  Instruction  (CACI) 

CACI  is  a  new  discipline  which  has  begun  to  emerge  in  the  past  decade.  The 
success  of  word  processing  programs  created  a  growing  population  of  people  who 
expected  computers  to  help  them  to  write.  Enterprises  such  Bell  Laboratories  and 
centers  of  learning  such  as  The  Pennsylvania  State  University  began  to  borrow  from 
the  field  of  stylistics  as  they  developed  computer  programs  that  would  help  students 
and  workers  plan,  write  and  edit  prose.  Several  universities,  such  as  Colorado  State 
University,  successfully  instituted  CACI  as  part  of  their  composition  curriculum. 
[Ref.  11]  The  earliest  CACI  programs  required  a  mainframe  or  at  least  a  minicomputer. 
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With  the  expansion  of  the  power  of  personal  computers,  however,  software  houses 
such  as  DecisionWare  Inc.  began  to  write  CACI  programs  for  that  environment.  Over 
10,000  copies  of  DecisionWare's  RightWriter  are  now  in  use  at  corporations,  agencies 
and  universities  around  the  world,  according  to  their  advertisements.  Given  that  word 
processing  remains  the  single  most  common  use  of  personal  computers,  as  one 
megabyte  of  main  storage  becomes  commonplace  on  new  personal  computers,  style 
checkers  such  as  RightWriter  may  soon  become  as  popular  as  spell  checkers  have 
become  in  the  past  five  years. 

B.       SCOPE  OF  THE  STYLIST 

The  Stylist  is  a  style  checker  akin  to  Bell  Laboratory's  Writers  Workbench  and 
DecisionWare  Inc.'s  RightWriter.  It  does  not  help  the  user  to  plan  or  to  compose  his 
product.  Instead,  it  analyzes  the  finished  product  and  provides  that  analysis  to  the 
user  for  his  consideration  toward  revision.  The  Stylist  does  not  determine  parts  of 
speech,  as  does  Waiter's  Workbench;  nor  does  it  suggest  alternatives  to  hackneyed 
phrases,  as  does  RightWriter.  What  the  Stylist  does  instead  is  to  analyze  some  of  the 
characteristics  of  the  words  used  in  the  text.  By  doing  so,  The  Stylist  can  distinguish 
between  short,  vigorous,  germanic,  emotional  texts  and  long,  lazy  latin  texts.  The 
success  of  this  effort  could  and  should  be  incorporated  into  more  extensive  programs 
such  as  Writer's  Workbench. 

In  addition,  The  Stylist  solves  the  traditional  problem  of  creating  a  concordance 
in  a  novel  and  elegant  way.  This  solution  is  optimal  for  the  personal  computer 
environment  of  the  coming  few  years. 
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II.  RESEARCH  FOR  THE  STYLIST 

A.       LITERATURE  REVIEW 

When  I  first  conceived  of  The  Stylist,  I  believed  that  a  "style  checker"  was  a 
completely  original  idea.  Little  did  I  know  that  major  universities  and  great 
coportations  had  been  working  on  the  problem  for  decades.  My  research  began  with 
the  Encyclopedia  Brittanica,  where  I  discovered  the  existance  of  the  fields  of  stylistics 
and  statistical  linguistics.  Using  these  as  subjects,  I  searched  the  Dewey  decimal 
system  and  several  automated  data  bases  for  titles.  The  books  I  discovered  referenced 
the  two  journals  which  publish  many  of  the  pertinent  articles  :  Computational 
Linguistics,  and  Computers  and  the  Humanities.  These,  in  turn,  carried  advertisements 
for  some  of  the  currently  available  software.  They  also  identified  the  current  centers  of 
learning,  some  of  which  I  contacted  for  guidance  and  information.  The  experience  of 
researching  the  field  demoted  me  from  its  inventor  to  its  lowliest  tyro.  It  also  subjected 
me  to  many  ideas,  some  of  which  I  incorporated  into  The  Stylist.  The  bibliography 
lists  the  sources  which  I  unearthed  which  may  benefit  other  students.  In  this 
discussion,  I  will  cover  only  those  articles  which  directly  influenced  the  development  of 
The  Stylist. 

1 .  Affective  Tone 

C.W.  Anderson  and  G.E.  McMaster  reported  on  a  program  called  PSA 
(Psychological  Semantic  Analysis)  which  analyzes  the  emotional  tone  of  a  text. 
[Ref.  12]  They  built  upon  the  work  of  D.R.  Heise,  who  ascribed  values  of  "Evaluation, 
Activity  and  Potency"  to  the  thousand  most  commonly-used  words  in  the  English 
language.  [Ref.  13]  PSA  allows  the  user  to  enter  his  text  one  line  at  a  time;  if  any  word 
is  ambiguous,  PSA  asks  the  user  to  disambiguate.  PSA  then  matches  these  words  to 
the  1000  Heise-word  dictionary,  adding  values  up  to  create  a  profile  of  the  user's  text. 
Table  1  helps  clarify  the  meanings  of  the  three  Heise-word  categories. 

After  building  and  testing  PSA,  Anderson  and  McMaster  concluded  that: 
"The  affective  tone  of  whole  passages  can  be  measured  by  computer-assisted  collection 
of  .  .  .  scores  of  those  words  ...  for  which  Heise  (1965)  has  provided  semantic 
meanings."   [Ref.  12] 
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TABLE  1 

HEISE  WORDS  VALUES 

EVALUATION 

High 

Neutral 

church 
God 

beauty 
pleasant 

experiment 
prove 
mountain 
suggest 

ACTIVITY 

High 

Neutral 

fire 
great 
attack 
fight 

well 
know 
last 
presence 

POTENCY 

High 

Neutral 

steel 
iron 
rock 
hard 

hear 
rich 
all 
indicate 

Low 

war 
fire 
disease 
bad 


Low 

dead 
silent 
sleep 
rock 


Low 

love 
kiss 
baby 
wife 


The  original  idea  of  The  Stylist  envisioned  doing  just  such  a  count  of  words 
chosen  for  their  frequency  of  use  and  their  characteristics  such  as  etymology,  emotional 
connotation  and  vigor.  Anderson  and  McMaster  convinced  me  that  this  method  had  a 
sound  psychological  base.  Moreoever,  Heise's  categories  of  Evaluation  and  Activity 
seemed  to  correspond  directly  to  my  envisioned  categories  of  emotional  connotation 
and  vigor.  His  category  of  Potency,  however,  struck  me  and  continues  to  strike  me  as 
distressingly  Freudian.  Because  The  Stylist  would  not  use  that  category  and  because  it 
would  use  others,  I  decided  not  to  borrow  Heise's  words  and  values,  but  rather  to  build 
my  own  dictionary  and  ascribe  my  own  values.  This  seemed  a  reasonable  approach, 
given  the  statement  by  Anderson  and  McMasters  that  "there  is  much  commonality  in 
the  emotional  response  of  different  persons  to  words  and  objects." 
2.  Readability  and  Sentence  and  Word  Length 

Two  of  the  most  common  stylistic  measures  are  sentence  length  and  word 
length.  The  most  common  use  of  these  measures  is  to  determine  readability.  The  basic 
idea  is  that  short  sentences  are  easy  to  read  and  long  sentences  are  hard  to  read. 
Various  formulae,  such  as  the  Kincaid,  the  Automated  Readability  Index  (ARI),  the 
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Colcman-Eiau,  and  the  Flcsch  Reading  Ease  Score,  [Ref.  10]  attempt  to  determine 
readability  as  a  straight-forward  function  of  sentence  and  word  length.  These  formulae 
have  had  considerable  impact  on  the  teaching  of  English.  Most  style  checkers  use 
them.  Some,  like  RightWriter,  admonishes  the  writer  any  time  a  sentence  grows 
beyond  22  words  or  so.  Having  read  a  great  deal  of  Victorian  novels  and  Madison 
Avenue  copy,  I'm  well  aware  that  the  ideal  20th  century  sentence  is  short.  It's 
vigorous.  Easy  to  understand.  Maybe  even  dispenses  with  its  verb  as  it  rushes  toward 
its  punctuation!  The  mechanization  of  this  modern  tendency  into  simplistic  formulae, 
however,  seems  to  me  pernicious.  A  sentence  should  be  the  unit  of  a  complete  idea.  If 
we  limit  ourselves  to  short  sentences,  we  may  be  limiting  ourselves  to  small  ideas.  The 
average  sentence  of  the  prose  of  the  Age  of  Reason  was  45  words.  If  Samuel  Johnson 
were  to  live  today,  no  one  would  let  him  finish  a  sentence.  My  distrust  of  these 
formulae  redoubled  when  I  read  "Readability  is  a  Four- Letter  Word,"  by  Jack  Selzer. 
[Ref.  14]  In  this  article,  Selzer  forcefully  makes  the  points  that  readability  is  a 
subjective  quality  which  is  greatly  influenced  by  factors  such  as  arrangement  of  ideas, 
reader  background  and  interest,  and  difficulty  of  vocabulary.  The  Stylist  does  not  use 
readability  formulae.  It  does  Hag  sentences  which  seem  to  be  run-on.  More 
importantly,  it  gauges  the  difficulty  of  the  vocabulary.  If  the  writer  uses  hard  words, 
long  words,  latinate  words  and  long  sentences,  The  Stylist  warns  him,  particularly  if  the 
work  is  fiction. 

B.       PRODUCT  TESTING 

1.   RightWriter 

As  part  of  my  research,  I  procured  a  copy  of  RightWriter*  and  tested  it  with 
some  of  my  own  writing.  Overall,  the  program  impressed  me  with  its  capabilities  and 
its  engineering.  RightWriter  reproduces  the  users  text  with  inserted  comments.  Its 
constant  challenge  of  long  or  complex  sentences  forced  me  to  reexamine  each  case  in 
particular.  Several  lengthy  sentences  became  two  short  ones.  It  never  let  me  begin  a 
sentence  with  "But,"  a  foible  of  mine.  It  applauded  my  writing  when  it  was  strong  and 
it  derided  it  when  it  was  pompous.  One  attractive  feature  was  the  production  of  a 
alphabetical  word  list  with  frequency  of  occurrence.  This  feature  demonstrated  my 
ovcrfondness  for  the  words,    "ancient"  and  "thousand".      RightWriter  also  suggested 


-RightWriter  (tm)  is  a  product  of  Decisionwarc  Inc.,  2033  Wood  St.,  Suite  218 
Sarasota,  IE  33577,  (313)  952-921 1. 
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substitutes    for    hackneyed    or    useless    phrases    such    as    "the    fact    that".     Overall, 
RightWriter  proved  itself  extensively  useful. 

2.  PC-Style 

PC-Style3  is  a  much  less  ambitious  program  than  RightWriter.  PC-Style 
required  only  40K  of  RAM,  compared  to  192K  for  RightWriter.  PC-Style  is  also  much 
cheaper,  costing  only  S29.95  as  compared  to  RightWriter's  S95.  I  tested  it  with  the 
same  test  data  set  I  had  used  on  RightWriter.  PC-Style  has  a  nice  human-factor 
feature,  in  that  it  constantly  displays  to  the  user  how  much  it  has  done  and  how  far  it 
is  from  finishing.  RightWriter  is  more  a  coffee-break  program  :  you  execute  it  and 
then  you  go  make  some  codec.  PC-Style,  however,  had  little  to  recommend  itself 
besides  this  feature.  It  relies  upon  a  readability  formula.  It  also  attempts  some 
affective  modeling,  based  upon  its  dictionary  of  50  action  verbs.  This  miniscule 
dictionary  is  inadequate  for  the  task.  A  match  of  less  than  two  percent  of  my  input 
text  with  these  50  words  was  typical;  such  a  small  sample  is  inadequate  to  qualify  the 
vigor  of  a  passage.  Another  simplistic  but  more  valuable  tactic  of  PC-St\le  was  to 
count  the  frequency  of  "personal'"  words,  such  as  "I",  "you"  and  "we".  Although  the 
conventions  of  the  more  stuffy  forms  of  writing  forbid  them,  it  is  generally  accepted 
that  most  technical  writing  benefits  from  direct,  personal  pronouns.  PC-Style 
reinforces  the  clarity  and  forccfulness  of  direct  rhetoric.  Despite  these  few  nice 
features,  PC-Style  is  a  too  simplistic  to  be  of  lasting  utility. 

3.  Writer's  Workbench 

Although  I  was  unable  to  experience  Writer's  Workbench,  I  did  obtain 
enough  research  materials  to  form  an  impression  of  its  utility.  [Rcf.  10]  Writer's 
Workbench4  is  actually  a  complex  of  32  programs.  Together,  these  programs  provide 
more  than  all  the  1'caturcs  of  RightWriter.  STYLE  calculates  readability,  using  the 
prewously-discussed  formulae.  It  also  analyzes  sentence  type  (simple,  complex  or 
compound).  STYLE  is  able  to  disambiguate  the  words  of  the  input  text  with  95% 
accuracy.  It  then  analyzes  the  use  of  verbs  and  modifiers.  If  a  passage  relies  too 
heavily  on  the  passive  voice  or  it  is  fat  with  modifiers,  STYLE  warns  the  user.  The 
follow-on  program.  DICTION,  detects  hackneyed  phrases.  SL'GGLST  suggests 
replacements.   Overall,  Writer's  Workbench  appears  to  be  the  industry  standard. 


3PC-Style  (tm)  is  a  product  oi'  ButtonWare  Inc.,  P.O.  Box  57S6,  Bellcvue,  WA 
98006,  (206)454-0479. 

Writer's  Workbench  (tm)  is  a  product  of  Bell  Laboratories,  Murray  Hill,  New 
Jersy,  07974. 
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C.       CONCLUSIONS  BASED  UPON  RESEARCH 

Now  I  knew  that  I  would  not  be  writing  the  first  style  checker.  Nor  would  I  be 
writing  the  second,  third,  fourth  or  fifth.  Some  solace  could  be  found  in  the  fact  that 
the  previous  programs  relieved  me  of  the  need  to  incorporate  all  stylistic  features  into 
The  Stylist.  I  could  extend  the  affective  modeling  of  PSA  by  including  other  values  of 
words.  Three  of  these  new  values  could  be  more  objective-etymology,  difficulty  and 
tangibility.  Together  with  vigor  and  emotional  connotation,  these  values  should  be 
able  to  describe  a  profile  of  the  user's  input  text. 

Additionally,  I  could  increase  the  accuracy  of  such  style  checkers  by  pointing  out 
the  need  to  make  allowances  for  the  genre  of  the  user's  input  text.  Any  reader  of 
technical  as  well  as  fictional  writing  knows  that  the  characteristics  of  these  two  types 
often  vary  more  widely  than  the  style  of  writers  within  the  type.  Comparative  style 
analysis,  therefore,  would  clearly  seem  to  need  to  take  the  genre  into  account. 
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III.  DESIGN  OF  THE  STYLIST 

A.       CHARACTERISTICS  OF  WORDS 

The  fundamental  idea  of  The  Stylist  is  that  individual  words  have  power.  The 
denotation  of  a  word  is  its  meaning.  The  connotation  of  a  word  is  its  emotional 
impact.  For  example,  "pupil",  "student",  and  "scholar"  all  denote  a  person  who  studies 
and  learns.  The  emotional  connotations,  however,  range  from  the  humble  "pupil" 
through  the  familiar  "student"  to  the  lofty  "scholar".  Besides  these  emotional 
connotations,  words  have  other  intrinsic  values  which  can  be  quantified.  The  Stylist 
would  concentrate  on  the  following  values:  Etymology,  Tangibility,  Difficulty, 
Emotional  Connotation  and  Vigor. 

1.  Etymology 

One  of  the  beauties  of  the  English  language  is  that  its  vocabulary  embraces 
two  main  sources:  native  and  borrowed.  Our  native  words  mainly  come  to  us  from  the 
Anglo-Saxon  tongues.  Borrowed  words  come  from  Latin,  mostly  by  way  of  the  French 
of  the  Norman  conquest.   The  following  table  illustrates  the  differences  : 


TABLE  2 
EXAMPLES  OF  NATIVE  AND  BORROWED  WORDS 

Native  Words  Borrowed  Words 

Man  Person 

Finger  Digit 

Thinker  Philosopher 

Fire  Conflagration 

Book  Volume 

Dirtbag  Miscreant 


Native  words  are  short,  strong  and  rough.  Borrowed  words  tend  toward 
length,  gentility  and  elegance.  Good  English  prose  (particularly  good  fiction)  favors 
native  words.  Poor  English  prose  (particularly  bad  technical  writing)  exhibits  a 
tendency  to  overutilize  latinate  etymology.  By  counting  up  the  number  of  native  and 
borrowed  words  in  a  user's  input  text,  The  Stylist  could  see  how  it  compares  to  good 
writing  of  the  appropriate  genre. 
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2.  Tangibility 

A  word  can  either  evoke  an  image  of  a  thing  or  it  can  refer  to  an  idea.    I  call 
the  former,  "tangible"  and  the  latter,  "intangible".    The  following  table  illustrates  the 

difference: 


TABLE  3 
EXAMPLES  OF  TANGIBLE  AND  INTANGIBLE  WORDS 


TANGIBLE         INTANGIBLE 
Rock  Ethereal 


Lips  Automatically 

Beehive  Then 

Corvette  Rely 

Hammer  Preliminary 


Tangible  words  are  concrete,  exact  and  evocative.  Intangible  words  are  ideal, 
general  and  cognitive.  All  writing  uses  both.  Good  writing  usually  takes  advantage  of 
tangible  words.  Even  the  most  philosophical  writing  benefits  from  the  use  of  tangible 
words.  (See  Appendix  E  for  an  analysis  of  a  Platonic  dialogue).  Good  fiction  writing 
rarely  strays  too  far  toward  the  intangible.  The  Stylist  could  count  the  tangible  and 
intangible  words  of  the  user's  input  text  and  compare  this  count  to  good  writing  of  the 
appropriate  genre. 
3.  Difficulty 

This  is  one  of  the  most  indisputable  characteristics  of  words.  For  the 
purposes  of  The  Stylist,  I  defined  four  levels  of  difficulty  :  Elementary,  High  School, 
Graduate  and  Postgraduate.  The  words  of  each  level  are  those  most  likely  to  be  used 
in  speech  with  ease  by  an  average  person  of  that  educational  level.  Although  I'm 
aware  that  reading,  writing  and  speaking  vocabularies  are  different,  I  contend  that  a 
person  reading  a  word  within  his  reading  vocabulary  but  outside  of  his  speaking 
vocabulary  often  must  pause  for  a  mental  translation  into  more  simple  terms.  The  act 
of  this  pausing  raises  the  difficulty  level  of  the  text.  Table  4  illustrates  the  categories. 
These  examples  are  based  upon  my  judgements. 

By  counting  the  occurrence  of  difficult  words,  The  Stylist  could  determine  the 
overall  difficulty  of  the  vocabulary  and  thus  the  readibility  of  the  text. 
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TABLE  4 

EXAMPLES  OF  WORDS  OF  VARIOUS  DIFFICULTY 

ELEMENTARY 

HIGH  SCHOOL 

GRADUATE 

POSTGRADUATE 

Big 

Tardy 

Matrix 

Execrable 

Sister 

Rendezvous 

Immaterial 

Parsimony 

Spoon 
Flying 

Transmission 

Tonality 

Recursive 

Foreman 

Universal 

Homomorph 

Wish 

Process 

Induction 

Dilatory 

Done 

Joyous 

Processor 

Zygote 

Handsome 

Undergo 

Linear 

Synergy 

4.  Emotional  Connotations 

The  Stylist  would  have  five  categories  of  emotional  connotation  :  Sublime, 
Pleasant,  Neutral,  Unpleasant  and  Horrid.  The  following  table  illustrates  the 
categories: 


TABLE  5 

EXAMPLES  OF  WORDS  OF  VARIOUS  EMOTIONAL  CONNOTATIONS 

SUBLIME    PLEASANT 

NEUTRAL 

UNPLEASANT 

HORRID 

Beauty    Happy 

The 

Damage 

Cancer 

Sunrise    Food 

Which 

Loss 

Murder 

Victory   Friendly 

Brick 

Insulting 

Whore 

Love       Warm 

Is 

Loser 

Fuck 

God       Helpful 

Name 

Cost 

Death 

Paradise   Sex 

When 

Wound 

Traitor 

By  counting  the  use  of  these  types  of  words,  The  Stylist  could  determine  the  overall 
emotional  tone  of  the  passage.    It  could  also  detect  flat  or  emotional  writing. 
5.  Vigor 

A  related  but  distinct  characteristic  is  vigor.  Sublime  and  horrid  words  tend  to 
be  highly  vigorous,  but  not  all  highly  vigorous  words  are  emotional.  Examples  of 
vigorous  but  unemotional  words  are  "sprint",  "rush"  and  "cross".  This  category  also 
tends  to  be  more  objective.  For  example,  the  word,  "soldier"  can  have  widely  different 
connotations  for  different  people.  The  word  "soldier"  would  please  a  career  Army 
officer  but  it  would  displease  a  survivor  of  the  Japanese  occupation  of  Canton.  Both 
could  agree  that  "solider"  is  a  vigorous  word. 
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TABLE  6 

EXAMPLES  OF  WORDS  OF  VARIOUS  VIGOR 

VIOLENT 

ENERGETIC 

CALM 

INERT 

Destroy 

Sprint 

Read 

From 

Creation 

Dive 

Write 

Into 

Fire 

Discipline 

Manager 

Something 

Atomic 

Wedding 

Ocean 

Comma 

Holocaust 

Steam 

Blue 

Paper 

Conqueror 

Flying 

Sought 

Format 

By  calculating  the  vigor  of  the  words  of  the  user's  input  text,  The  Stylist 
would  be  able  to  estimate  its  overall  strength. 

B.  THE  NEED  FOR  A  DICTIONARY 

To  quantify  the  above  five  aspects  of  the  words  of  the  user's  input  text,  it  would 
be  necessary  to  maintain  a  dictionary  of  the  most  commonly  used  words  and  their 
values.  The  original  idea  of  The  Stylist  forsaw  just  such  a  dictionary.  After  reading 
Anderson  [Ref.  12]  and  testing  PC-Style,  the  idea  seemed  less  original  but  still  valid. 
The  heart  of  The  Stylist,  therefore,  would  be  its  dictionary7.  Before  I  began  any  top- 
down  designing,  I  first  wanted  to  explore  the  technical  challenges  and  possible  pitfalls 
of  building  such  a  dictionary.  A  review  of  data  structure  literature  convinced  me  that 
two  approaches  were  the  most  feasible:  hashing  and  Binary  Search  Tree.  Since  I  had 
already  decided  that  The  Stylist  would  also  produce  a  concordance,  I  had  to  take  into 
account  the  need  for  efficient  alphabetical  traversal  of  the  words  of  the  user's  input 
text.  My  intuition  that  hashing  would  be  preferable  for  dictionary  look-up  but  would 
not  lend  itself  easily  to  concordance-building  was  confirmed  by  a  passage  in  an 
excellent  text  by  Donald  E.  Knuth.  [Ref.  15:  p.  540]  Having  decided  to  implement  the 
dictionary  with  a  Binary  Search  Tree  (BST),  I  began  to  build  a  series  of  prototypes. 

C.  THE  COPYDIX  PROTOTYPES 

CopyDixl  through  CopyDix6  were  early  experiments  in  using  a  Binary  Search 
Trees  for  dictionary  lookup.  Naturally,  the  word  was  the  key  value.  The  experience  of 
the  CopyDix  series  taught  me  the  following  lessons: 
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1 .  The  Benefit  of  Preorder  Storage 

The  structure  of  a  Binary  Search  Tree  is  maintained  during  execution  as  a 
system  of  pointers.  These  pointers  refer  to  locations  in  memory  which  pertain  only  to 
that  execution.  If  a  dictionary  is  modified  during  execution  (for  example,  if  new  words 
are  added),  then  the  BST  should  be  stored  to  its  file  in  preorder.  An  inorder  storage 
seems  the  most  logical  way  but  it  is  actually  the  worst.  If  a  BST  is  stored  using  an 
inorder  traversal,  the  next  time  it  is  loaded  it  will  be  that  least  bushy  of  all  BSTs:  a 
linked  list.  Storage  using  preorder  will  cause  the  next  loading  of  the  tree  to  duplicate 
the  last. 

2.  The  Need  for  AVL 

Even  with  preorder  storage,  the  dictionary  needed  the  capability  to  change:  to 
grow  or  shrink  as  the  user  desired.  Any  particular  office  or  curriculum  has  its  own 
special  vocabulary.  I  wanted  to  give  The  Stylist  the  ability  to  adapt  its  dictionary  to 
the  vocabulary  of  its  user's  environment.  Without  some  mechanism  for  rebalancing 
the  BST  after  the  insertion  of  new  words  or  the  deletion  of  unwanted  words,  the  BST 
could  become  increasingly  lopsided.  Searching  for  words  in  such  a  lopsided  tree  would 
become  inefficient.  Clearly,  an  AVL  scheme  was  required.  I  adapted  an  AVL  insertion 
routine  from  an  excellent  text  by  Niklaus  Wirth.   [Ref.  16:  pps.  220  -  221] 

3.  Storage  requirements 

The  CopyDix  series  allowed  me  to  see  whether  a  BST  of  5000  words  and 
associated  values  would  fit  into  the  main  storage  available  to  a  healthy  personal 
computer.  I  determined  that  1.5  megabytes  would  be  sufficient  for  The  Stylist  and  its 
data  structures.  The  Copydix  series  allowed  me  to  create  such  a  structure  and  to  prove 
that  it  would  not  require  more  than  one  megabyte. 

4.  Text  Processing 

The  CopyDix  series  also  identified  some  unforseen  complexities  in  processing 
text.  For  example,  in  order  to  include  contractions,  it  was  necessary  to  include  the 
single  quotation  mark  (')  in  the  dictionary.  Waterloo  Pascal  uses  this  character  to 
define  the  beginning  and  end  of  user-defmed  constants  and  strings.  To  include 
contractions,  it  was  necessary  to  declare  "Succ('@')"  instead  of  (')!  There  were  a  few 
other  similar  problems  requiring  equally  unhappy  solutions. 

5.  Building  a  Dictionary 

Finally,  the  CopyDix  series  allowed  me  to  begin  to  build  a  dictionary  of  most 
frequently  used  words.  I  ran  the  text  of  two  novels  and  several  technical  articles 
through  the  Copydix  series,  accumulating  a  dictionary  of  over  2000  words. 


D.       CONCORDANCE  SEARCH  TREE  (CST) 
1.  Original  Concept 

Based  upon  the  experience  of  the  CopyDix  series,  I  began  to  play  around  with 
the  various  ways  of  building  a  concordance.  The  literature  provided  two  examples  of 
how  the  problem  has  already  been  solved.  One  solution  was  to  create  a  search  tree. 
The  key  value  of  the  search  tree  was  the  word;  two  associated  tables,  TOKEN  and 
TYPE,  recorded  the  information  about  the  sequence  of  occurrence  of  the  words. 
[Ref.  7:  pp.  186  -  191]  Another  solution  familiar  to  most  computer  scientists  was  the 
Key  Words  in  Context  (KWIC)  program  designed  by  Parnas.  [Ref.  17]  This  solution 
creates  a  KWIC  listing  by  circular  shifting  each  line  and  then  sorting  each  line;  such  a 
KWIC  listing  is  similar  to  a  concordance. 

I  wanted  to  create  a  concordance  without  relying  upon  any  external  storage. 
My  experience  with  the  CopyDix  series  had  shown  that  text  processing  is  both  I,  O  and 
computation  bound:  1,0  bound  because  large  text  files  must  be  loaded;  computation 
bound  because  each  word  must  be  processed  character  by  character.  I  wanted  the 
solution  to  eliminate  the  need  for  further  input  or  output  (such  as  using  secondary 
storage  to  build  files  containing  KWIC  lines).  I  also  wanted  to  minimize  computation 
by  allowing  The  Stylist  to  remember  the  order  of  occurrence  of  each  word  of  the  input 
text,  without  having  to  recompute  it.  I  wanted  my  solution  to  flow  as  naturally  as 
possible  from  the  process  of  looking  each  word  up  in  the  dictionary.  I  didn't  want  to 
have  to  process  each  word  twice,  or  to  search  for  the  position  of  any  word  twice. 
Ideally,  the  Stylist  would  have  enough  main  storage  available  that  it  could  retain  all 
important  information,  without  resorting  to  recomputation  or  to  secondary  storage. 
These  ideas  led  me  play  to  around  with  various  ways  of  using  a  BST  to  record  the 
order  of  occurrence  of  the  words  of  the  user's  input  text.  One  obvious  solution  would 
be  to  have  each  node  of  the  BST  contain  pointers  to  associated  data  structures  such  as 
an  array  of  linked  lists  containing  the  sentences  of  the  input  text.  Such  a  solution, 
however,  would  require  storing  the  words  twice.  CopyDix  had  shown  that  words  take 
up  a  great  deal  of  storage  space.  Storing  them  twice  would  be  wasteful.  With  these 
ideas  in  mind,  I  lit  upon  the  idea  of  the  Concordance  Search  Tree  (CST). 

A  CST  is  a  Binary  Search  Tree  with  a  linked  list  threaded  through  it.  This 
linked  list  is  two-way,  connecting  each  word  of  the  BST  to  the  word  used  before  it  and 
to  the  word  used  after  it.  An  inorder  traversal  of  the  BST  visits  each  word  in 
alphabetical  order.    During  each  visit,  a  traversal  of  the  linked  list  in  the  "backwards" 
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direction  would  encounter  all  the  words  used  before  that  word.  Printing  these  words, 
then  printing  the  key  word  being  visited,  and  finally  printing  the  words  in  the 
"forwards'"  direction  creates  a  concordance. 

This  solution  offers  some  exciting  facilities.  It  allows  the  user's  input  text  to 
be  accessed  in  any  number  of  ways.  If  a  concordance  of  three,  five,  seven  or  thirteen 
words  per  line  is  desired,  changing  a  few  global  constants  immediately  fulfills  that 
desire.  If  only  the  sentences  containing  a  certain  word  are  desired,  a  insert  search  of 
the  CST  keying  on  that  word,  and  then  a  concordance  traversal  on  that  node,  provides 
all  those  sentences.  In  short,  a  CST  is  a  complex  structure  with  a  challenging  but 
strong  intuitive  appeal.    It  provides  a  great  deal  of  flexibility  to  the  processing  of  text. 

One  added  wrinkle  of  complexity  lies  in  the  fact  that  many  words  of  the  input 
text  occur  more  than  once.  The  word,  "the",  for  example,  occurs  many  times.  To 
guarantee  a  correct  concordance  traversal  of  the  CST,  each  occurrence  of  the  word 
"the''  must  be  associated  with  a  unique  pointer.  This  requirement  is  satisfied  by 
creating  a  unique  node  of  pointers  associated  with  each  occurrence  of  the  word.  The 
main  node  of  the  CST  contains  the  word,  its  values,  and  left  and  right  pointers.  It  also 
contains  a  pointer  called  "down",  which  points  to  the  linked  list  of  unique  nodes  of 
pointers.  The  first  such  node  pertains  to  the  first  occurrence  of  the  word,  "the",  in  the 
user's  input  text.  It  points  to  the  word  used  before  and  the  word  used  after  that 
occurrence  of  the  word  "the".  (These  pointers  are  called  "last"  and  "next".)  It  also 
points  "down"  to  the  second  occurrence  of  the  word  "the".  So  the  Stylist  creates  a 
concordance  through  the  following  traversals:  an  inorder  traversal  of  the  CST  visits 
each  main  node  in  alphabetical  order.  During  that  visit,  The  Stylist  goes  down  the 
linked  list  of  the  occurrences  of  that  word.  During  each  step  down  the  linked  list,  The 
Stylist  traverses  "lastward"  and  then  "nextward"  to  print  out  the  line  of  that 
occurrence.  When  The  Stylist  reaches  the  bottom  of  the  downward  list,  it  continues 
the  inorder  traversal. 

2.  The  Tril  -  Tri9  Prototypes 

To  examine  the  feasibility  of  the  CST  concept,  I  wrote  a  series  of  progressively 
more  capable  prototypes  called  Tril  through  Tri9.  (The  names  refer  to  a  "Trinary 
Search  Tree".)  I  found  that  I  could  implement  all  the  procedures  necessary  to  build 
and  traverse  a  CST  in  only  137  lines  of  code.  Moreover,  the  performance  of  the 
concept,  both  in  terms  of  main  storage  and  processor  requirements,  seemed  extremely 
satisfactory. 
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E.       DESIGN  OF  THE  STYLIST 

Having  used  a  series  of  prototypes  to  identify  and  solve  the  critical  issues,  I  set 
them  aside  and  began  to  design  The  Stylist.  After  jotting  down  some  logic  flowcharts 
and  data  flow  diagrams,  I  decomposed  the  problem  into  three  main  modules  which  I 
called  "Reader",  "Researcher"  and  "Reporter". 

1.  The  Reader  Module 

Only  Reader  has  access  to  the  user's  input  text.  It  reads  that  text  and  passes 
along  a  data  structure  called  "readnode",  which  is  merely  the  word  and  its  length.  The 
Reader  also  signals  the  ends  of  sentences  and  the  end  of  the  file. 

2.  The  Researcher  Module 

Researcher  receives  the  "readnodes",  researches  the  qualities  of  that  word, 
builds  a  profile  of  the  user's  input  text  based  upon  the  tally  of  the  qualities  of  those 
words,  and  then  passes  that  profile  on  to  the  Reporter  Module.  If  the  user  wants  one, 
Researcher  also  creates  the  concordance.  Only  Researcher  has  access  to  the  dictionary 
file  and  to  the  CST.  Neither  Reader  nor  Reporter  know  how  the  dictionary  is 
implemented  or  how  the  concordance  is  produced.  This  is  the  most  crucial  instance  of 
information  hiding  in  the  design. 

3.  The  Reporter  Module 

Reporter  receives  the  profile,  which  it  compares  to  the  profiles  of  other  texts. 
Based  upon  this  comparison,  it  passes  on  recommendations  and  commendations  to  the 
user. 

4.  Low  Level  Design 

Having  decomposed  the  problem  into  these  three  modules,  I  jotted  down  a 
plan  for  the  procedures  that  would  comprise  them.  This  plan  included  the 
input  output  and  operations  of  each  procedure.  This  low  level  design  included  all  the 
parameters  of  the  procedures  and  identified  the  hierarchy  between  the  procedures  and 
modules. 

One  of  the  important  features  built  into  this  low  level  design  was  the  human- 
factor  facility  of  reporting  to  the  user  the  progress  of  the  execution  of  The  Stylist.  The 
need  to  send  messages  to  the  screen  telling  the  user  what  was  happening  required 
counting  steps  and  further  input/ output,  but  my  experiences  with  Right  Writer,  PC- 
Style  and  the  CopyDix  prototypes  had  convinced  me  that  the  expense  was  worth  it. 
Psychologically,  staring  at  a  blank  screen  for  five  minutes  seems  ten  times  as  long  as 
reading  a  dynamic  screen  for  seven  minutes.   To  provide  the  user  the  option  of  quicker 


but  more  boring  execution  over  slower  but  more  engaging  execution,  I  designed  a 
facility  that  asked  him  whether  he  wanted  frequent  reports,  occasional  reports  or  one 
report  when  done. 

With  this  low  level  design  in  hand,  I  was  ready  to  begin  to  code. 
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IV.  IMPLEMENTATION  OF  THE  STYLIST 

A.       CODING 

Because  of  the  experience  gained  from  the  CopyDix  prototypes  and  because  of 
the  top-down,  modular  design,  coding  the  Reader  and  Researcher  modules  took  only 
two  weeks.  Prototyping  and  top-down  design  allowed  me  to  code  quickly  and  with  few- 
errors.  (The  usefulness  of  these  software  engineering  techniques  would  also  prove  itself 
in  the  testing  phase.)  As  I  had  the  CopyDix  series,  I  implemented  The  Stylist  in  a 
series  of  iterative  steps  called  Style  1  through  Style9. 

Style6  brought  to  light  light  an  unforseen  problem:  the  interaction  between  the 
facilities  of  adding  new  words  to  the  dictionary  and  creating  a  concordance.  The 
design  called  for  allowing  the  user  to  decide  whether  he  wanted  to  add  new  words  to 
the  dictionary  and  whether  he  wanted  a  concordance.  These  two  choices  created  four 
cases:  Growing  Dictionary  and  Concordance;  Growing  Dictionary  and  no 
Concordance;  Static  Dictionary  and  Concordance;  Static  Dictionary  and  no 
Concordance.  In  the  case  of  Static  Dictionary  and  Concordance,  to  create  a 
concordance,  new  words  found  in  the  user's  input  text  would  have  to  be  added  to  the 
CST.  But  since  the  user  had  chosen  not  to  add  new  words  to  the  dictionary,  The 
Stylist  could  not  prompt  him  for  the  values  of  these  new  words.  This  was  not  a 
problem  until  the  same  new  word  was  used  again.  Researcher  would  find  this  new 
word  in  the  CST  but  would  not  find  any  values  for  the  word.  This  would  cause  a  fatal 
error:  "Word. etymology  has  unassigned  value."  To  avoid  this  problem,  it  was 
necessary  to  add  a  new  field  to  each  of  the  nodes:  Status.  The  status  of  a  node  could 
be:  "Valuable",  which  meant  that  the  node  contained  values  for  the  other  fields; 
"NotValuable",  which  meant  it  didn't. 

This  particular  problem  grew  even  more  gnarly  when  I  decided  to  extend  the 
Look_Up  procedure  of  Researcher.  The  orginal  design  called  for  Look_L'p  to  try  to 
match  the  input  word  only  with  its  exact  equal  in  the  dictionary.  This  worked  fine,  but 
it  meant  that  the  dictionary  would  have  to  contain  all  the  following  variants  of  "look"  : 
"look",  "looks",  "looked",  and  "looking".  The  extension  created  a  new  procedure  called 
Look_L'p_Variants.  If  the  input  word  didn't  match  any  word  in  the  dictionary, 
Researcher  looked  up  variants  of  the  word.    If  it  found  one,  the  values  of  the  variant 
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were  added  to  the  profile.  This  extension  allowed  a  more  compact  dictionary  and  it 
increased  the  probability  that  most  words  would  be  found,  but  it  complicated  the 
problem  described  in  the  previous  paragraph.  In  the  case  of  Growing  Dictionary  and 
Concordance,  a  word  like  "looked"  would  have  to  be  added  to  the  CST  for  the  sake  o[ 
a  concordance,  but  now  it  would  also  be  added  to  the  dictionary.  Look_up_variants 
would  thus  eventually  gum  up  the  dictionary  with  a  lot  of  useless  variants.  Solving  the 
problem  meant  adding  a  third  status.  The  three  statuses  were  now  :  "Storable",  which 
meant  the  node  was  a  first  class  citizen,  a  word  which  had  values  and  should  be  stored 
to  the  dictionary;  "Not_Storable",  which  meant  that  the  node  was  a  second  class 
citizen,  a  variant  word  which  had  values,  but  should  not  be  stored  to  the  dictionary; 
and  "Not_Valuable",  which  meant  the  word  was  a  third  class  citizen,  a  new  word  which 
had  no  values,  nor  should  be  stored  to  the  dictionary. 

The  solution  of  these  problems  completed  the  coding  phase  of  the  Reader  and 
Researcher  modules.  The  Reporter  module  remained  a  stub.  The  Stylist  could  not 
make  recommendations  and  commendations  to  the  user  until  it  had  established  its 
dictionary  and  built  up  a  history  of  other  profiles. 

B.       BUILDING  THE  DICTIONARY 

The  CopyDix  series  and  Style  1  through  Style 7  had  built  up  a  dictionary  of  some 
3000  words,  which  contained  about  80%  or  85%  of  the  words  of  any  given  text.  Each 
of  these  3000  words,  however,  now  required  the  assignation  of  five  values  :  etymology, 
tangibility,  difficulty,  emotional  connotation  and  vigor.  That  totals  to  15,000  values. 
After  experimenting,  I  developed  a  system  that  allowed  me  to  enter  those  15.000  values 
in  two  weeks.  First,  I  sorted  the  dictionary  alphabetically  and  printed  out  a  listing. 
Using  Funk  and  Wagnall's  Collegiate  dictionary,  I  checked  the  etymology  and 
highlighted  the  borrowed  words  on  the  print-out.  Then  I  wrote  a  program  called 
AUTOMATE,  which  displayed  a  screen  for  each  of  the  five  values  of  each  word  and 
required  one  keystroke  to  assign  a  value.  Using  this  system,  I  assigned  values  to  all 
3000  words,  going  at  a  speed  that  relied  upon  a  subjective  reaction  similar  to  the 
reaction  of  a  reader  scanning  a  text  at  a  comfortable  reading  pace.  Then  I  wrote  two 
new  programs,  DIXSPLIT  and  DIXJOIN.  DIXSPLIT  split  the  dictionary  into  17 
intersecting  subsets  :  all  native  words,  all  borrowed  words,  all  tangible  words,  all 
intangible  words,  and  so  on.  Each  of  these  subsets  was  like  a  formation  of  soldiers. 
Any  word  that  didn't  belong  in  that  subset  stuck  out  conspicuously.  After  correcting 
these  mistakes.  I  executed  DIXJOIN,  which  brought  the  amended  dictionary  together. 

Now  I  was  ready  to  begin  testing  The  Stylist. 
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V.  TESTING  OF  THE  STYLIST 

A.       INITITAL  TEST  DATA  SET 

The  initial  test  data  set  for  The  Stylist  included  both  fiction  and  technical  writing. 
Fiction  was  represented  by  long  passages  from  two  of  my  novels  and  1000- word 
excerpts  from  the  novels  of  Ernest  Hemingway,  Raymond  Chandler  and  Kurt 
Vonnegut.  Technical  writing  was  represented  by  about  a  dozen  student  essays  and 
excerpts  from  three  textbooks  on  computer  science.  In  both  categories,  I  included 
both  good  and  poor  writing.  I  ran  all  the  texts  through  Style8,  the  Reporter  module  of 
which  merely  printed  out  the  values  of  their  profiles.  Style8  performed  robustly.  I 
executed  it  under  all  four  cases  pertaining  to  the  dictionary  and  concordance  as  well  as 
under  all  three  cases  pertaining  to  the  frequency  of  execution  status  reports.  Style8 
revealed  no  major  flaws.  Its  shortcomings  were  well  within  the  scope  of  the  original 
design.  (See  Appendix  A,  Suggested  Extensions  to  The  Stylist).  StyleS  suggested  some 
fine-tuning  changes. 

Two  changes  dealt  with  the  part  of  the  CST  scheme  called  the  FourDix.  The 
FourDix  contains  the  most  common  function  words,  all  of  which  are  four  letters  or  less 
in  length.  Examples  of  FourDix  words  are  "the",  "a",  "then",  "or",  "he",  "she",  and  so 
on.  The  FourDix  is  a  separate  CST.  Words  of  the  user's  input  text  of  four  or  less 
letters  in  length  are  first  sought  for  in  the  FourDix.  This  is  an  economy  measure,  since 
a  large  portion  of  any  text  is  made  up  of  the  function  words  found  in  the  FourDix. 
Another  savings  of  the  FourDix  is  that  the  concordance  is  created  by  an  inorder 
traversal  of  the  main  CST.  This  means  that  the  functional  words  of  the  FourDix  are 
included  in  the  concordance  only  in  the  phrases  embracing  the  more  significant  words. 
Otherwise,  the  concordance  would  have  dozens  or  perhaps  hundreds  of  lines  showing 
each  use  of  words  like  "the".  The  first  fine-tuning  change  suggested  by  Style8  dealt 
with  the  values  of  the  words  of  the  FourDix.  These  words  had  values,  of  course,  but 
they  tended  to  obscure  the  overall  picture.  Eliminating  their  values  from  the  tally  was 
the  first  solution.  I  modified  this  to  adding  only  those  values  of  personal  pronouns, 
which  are  words  of  energetic  vigor.  This  is  similar  to  the  approach  of  PC-Style.  The 
second  fine-tuning  change  was  not  adding  the  length  of  the  FourDix  words  to  the  tally 
of  word  lengths.   With  the  FourDix  words,  a  bar  graph  of  word  length  usually  had  two 
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humps  :  one  tall  one  around  three  letters  and  another,  shorter  one  near  five  letters.  By 
eliminating  the  length  of  the  FourDix  words,  both  fiction  and  nonfiction  writing  always 
showed  a  bell-shaped  curve.  The  bell  of  fiction  writing  tends  to  be  a  tall  one  centered 
on  five  letters/word;  the  bell  of  nonfiction  writing  tends  to  be  a  shorter  one  centered  on 
six  letters/word.  All  this  means  is  that  fiction  writers  use  shorter  words  and  words  of 
more  uniform  length,  whereas  nonfiction  writers  use  longer  words  and  words  of  more 
variable  length.  As  a  stylistic  measure,  its  use  is  limited.  As  a  bar  graph,  it  is  a  lot  of 
fun. 

In  any  case,  the  profiles  obtained  allowed  me  to  examine  the  true  usefulness  of 
The  Stylist.  Thankfully,  The  Stylist  was  able  to  distinguish  between  the  novels  of 
Ernest  Hemingway  and  the  ramblings  of  computer  science  graduates.  The  profiles  of 
each  text  corresponded  closely  to  my  subjective  impressions.  (This  is  not  surprising 
since  the  values  of  the  dictionary  were  also  the  product  of  my  subjective  decisions.  In 
this  sense,  The  Stylist  is  an  expert  system  that  has  automated  my  own  rules  and  sense 
of  style.  The  facility  of  allowing  the  dictionary  to  change,  however,  allows  any  user  to 
adapt  The  Stylist  to  his  own  taste). 

I  now  could  finish  coding  Reporter.  Reporter  now  took,  the  profile  and 
manipulated  its  values  to  create  an  Analysis.  This  manipulation  took  into  account  the 
genre  of  the  user's  input  text  and  involved  simple  weighting  factors  rather  than  any 
complex  statistical  methods.  Like  the  original  values  of  the  words  of  the  dictionary, 
these  weighting  factors  depended  upon  my  own  subjective  impressions,  but  they  also 
derived  from  the  results  of  the  test  data  set  runs.  If  a  technical  paper  had  four  times  as 
many  intangible  words  than  tangible  words,  it  read  like  so  much  mush.  So  I 
incorporated  a  ratio  of  4:1  as  the  limit  of  intangible:tangible  for  nonfiction.  I  tried  to 
be  conservative  in  these  weightings,  because  I  always  envisioned  The  Stylist  as  a 
descriptive  rather  than  a  prescriptive  tool.  The  Stylist  thus  rebukes  the  user  only  when 
the  qualities  of  his  text  fall  far  outside  the  bounds  of  the  norm. 

B.       FIELD  TESTS 

The  Stylist  was  finally  ready  for  field  testing.  This  final  phase  was  meant  to 
mimic  the  introduction  of  the  software  product  for  consumer  use.  The  field  testing 
took  place  in  two  parts.  In  the  first,  two  computer  science  graduates  were  briefed  on 
the  program  and  asked  to  use  it.  In  the  second,  22  papers  for  an  Administrative 
Science  composition  course  were  run  though  The  Stylist. 


30 


1.  Field  Test  One  :  The  Ape  Test 

The  two  computer  science  graduate  students  evaluated  the  program  by 
unleashing  it  on  some  of  their  own  writing.  Their  overall  reaction  was  highly  positive, 
possibly  because  The  Stylist  seemed  to  approve  of  their  writing.  They  praised  the 
human  factors  and  the  graphics.  They  thought  that  the  breakdowns  of  the  values  and 
the  concordances  were  intriguing  and  offered  them  insights  into  their  own  writing. 
They  complained  that  the  report  didn't  explain  the  meanings  of  some  of  the  values. 
They  also  didn't  like  the  need  to  remove  embedded  commands  from  the  input  text.  I 
addressed  these  complaints  by  adding  some  explanations  to  the  report  and  by  changing 
Reader  so  that  it  didn't  signal  End_of_Sentence  when  it  encountered  an  embedded 
command. 

2.  Field  Test  Two  :  The  Writing  Class 

A  Naval  Postgraduate  School  Administrative  Science  composition  class 
provided  22  short  papers  for  testing.  These  students  are  almost  all  military  officers 
who  are  considered  by  their  services  as  top  performers.  As  such,  they  constitute  a 
somewhat  literate  and  professional  test  group.  Their  fairly  uniform  papers  were 
apparent  attempts  to  incorporate  some  of  the  lessons  of  good  business  communication. 
I  ran  their  papers  through  The  Stylist,  then  provided  the  reports  to  the  Professor,  who 
gave  them  to  the  students.  I  never  met  the  students;  neither  did  they  ever  use  The 
Stylist  themselves.  Of  the  22  students,  12  filled  out  a  questionnaire  detailing  their 
reactions  to  the  reports. 

Most  (10  out  of  12)  had  never  heard  of  a  style  checker  before.  Most  (11  out 
of  12)  understood  the  reports.  One  student  thought  The  Stylist  was  worthless:  11 
found  some  of  its  features  helpful.  Six  students  found  the  measures  of  word  length  and 
sentence  length  helpful.  Nine  found  the  measures  of  vigor,  etymology  and  difficulty 
helpful.  Six  liked  the  concordance;  four  didn't  think  it  was  useful.  Interestingly,  only 
five  thought  The  Stylist  accurately  reflected  the  qualities  of  their  writing;  three  were 
sure  it  didn't;  three  just  weren't  sure.  One  common  complaint  was  that  they  weren't 
sure  of  the  meanings  of  some  of  the  categories. 

Overall,  this  test  underscored  the  need  for  a  user's  manual.  Appendix  B  is  just 
such  a  manual.  If  this  manual  had  been  available  for  the  students  or  if  they  had  had 
more  experience  using  The  Stylist,  then  they  would  have  better  understood  the 
meaning  of  the  reports.  Despite  this  drawback,  it  was  obvious  from  the  comments  in 
the  questionnaire  that  the  majority  of  the  students  liked  The  Stylist  and  would  want  to 
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use  it  or  similar  programs  to  analyze  their  writing.  This  test,  therefore,  indicated  that 
The  Stylist  would  be  able  to  find  a  place  in  the  classroom  and  possibly  even  the 
market. 
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VI.  CONCLUSIONS 

Researching,  designing,  coding  and  testing  The  Stylist  was  an  excellent  academic 
exercise.  It  brought  home  many  of  the  lessons  of  software  engineering.  It  sharpened 
my  skills.  The  final  product,  The  Stylist,  seems  a  success  in  that  its  affective  modeling 
works.  As  such,  The  Stylist  should  serve  as  a  contribution  to  the  development  of  other 
style  checkers,  rather  than  a  stand-alone  style  checker  itself.  As  to  whether  writers, 
teachers  and  students  should  use  computers  to  analyze  writing,  I'm  reminded  of  the 
following  passage  from  Plato's   "Phaedrus": 

Socrates  : 

At  the  Egyptian  city  of  Xaucratis,  there  was  a  famous  old  god,  whose  name 
was  Theuth  ...  his  great  discovery  was  the  use  of  letters.  Now  is  those  days  the 
god  Thamus  was  the  king  of  the  whole  country  of  Egypt  ...  To  him  came 
Theuth  and  showed  his  inventions  .  .  .  when  they  came  to  letters,  This,  said 
Theuth,  will  make  the  Egyptians  wiser  and  give  them  better  memories;  it  is  a 
specific  both  for  the  memory  and  for  the  wit.  Thamus  replied:  O  most  ingenious 
Theuth,  the  parent  or  inventor  of  an  art  is  not  always  the  best  judge  of  the  utility 
or  inutility  of  his  own  inventions  to  the  users  of  them.  And  in  this  instance,  you 
who  are  the  father  of  letters,  from  the  paternal  love  of  your  own  children  have 
been  led  to  attribute  to  them  a  quality  which  they  cannot  have;  for  this 
discovery  of  yours  will  create  forgetfulness  in  the  learners'  souls,  because  they 
will  not  use  their  memories;  they  will  trust  to  the  external  written  characters  and 
not  remember  of  themselves.  The  specific  which  you  have  discovered  is  not  an 
aid  to  memory,  but  to  reminiscence,  and  you  give  your  disciples  not  truth,  but 
only  the  semblance  of  truth;  they  will  be  hearers  of  many  things  and  will  have 
learned  nothing;  they  will  appear  to  be  omniscient  and  will  generally  know 
nothing;  they  will  be  tiresome  company,  having  the  show  of  wisdom  without  the 
reality.   [Ref.  18] 

In  the  fourth  century  before  Christ,  wise  Athenians  were  debating  the  uses  and 
evils  of  literature  itself.  As  Plato  points  out  in  the  above  passage  toward  the  end  of  his 
dialogue,  literature,  the  act  of  writing  down  our  ideas,  can  have  its  pitfalls.  It  can 
weaken  our  memories.  (Let  anyone  who  has  never  forgotten  where  he  parked  his  car 
argue  this  point.)  It  can  also  lead  to  intellectual  cheating.  The  wisdom  of  a  man 
capable  of  piecing  together  an  article  with  the  aid  of  a  library  and  a  long  weekend  may 
indeed  be  less  than  the  wisdom  of  a  man  who  can  stand  before  a  learned  crowd  and 
speak  an  intelligent  discourse. 
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Nevertheless,  civilization  has  embarked  on  a  course  inseparable  from  literature 
and  writing.  Wealth,  power  and  knowledge  has  followed  the  progress  of  the  written 
word.  Literature  has  made  us  strong,  but  to  the  extent  that  we  rely  upon  books 
instead  of  our  minds,  we  have  grown  weak.  Now,  at  what  we  call  the  dawn  of  the 
information  age,  we  would  do  well  to  remember  the  reservations  of  Plato.  Electronic 
computation  offers  us  dazzling  abilities,  but  to  the  extent  we  rely  upon  it  instead  of  our 
minds,  we  will  grow  weak. 

Therefore,  any  style  checker  or  CACI  program  should  be  used  as  an  interesting 
tool  that  provides  a  fresh  perspective  on  writing.  These  programs  cannot  take  and 
should  not  be  put  in  the  place  of  thoughtful  readers,  editors,  teachers  and  friends. 
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APPENDIX  A 
SUGGESTED  EXTENSIONS  TO  THE  STYLIST 

1.  ABBREVIATIONS 

Reader's  Read_Intext  procedure  ignores  embedded  commands  such  as 
".embedded"  by  keeping  count  of  the  number  of  words  read  since  the  end  of  the  last 
sentence.  If  only  one  word  has  been  read  when  another  period  is  encountered, 
Read_Intext  does  not  signal  end_of_sentence.  This  simple  feature  also  allows 
Read_Intext  to  ignore  the  ellipsis  ("...").  It  doesn't  always  allow  it  to  ignore 
abbreviations  such  as  "Mr.",  "Dr.",  "etc.",  or  "e.g.".  If  these  abbreviations  appear  as 
the  first  word  of  this  sentence,  Read_Intext  doesn't  signal  end_of_sentence.  If  the 
abbreviation  appears  somewhere  in  the  middle  of  the  sentence,  Read_Intext  does  signal 
end_of_sentence.  An  example  is  the  following  sentence  :  "You  should  have  told  Prof. 
Wu  about  this  earlier."  Read_Intext  would  signal  end_of_sentence  at  "Prof."  and 
"earlier.".  This  is  a  shortfall  of  the  Reader  module,  but  one  which  I  haven't  corrected 
for  four  reasons:  first,  such  occurrences  are  relatively  rare;  second,  breaking  the 
sentences  into  two  always  favors  the  user;  third,  the  length  of  sentence  measure  is  not 
the  thrust  of  The  Stylist;  fourth,  the  solution  would  involve  unwanted  overhead  in 
computation. 

Writer's  Workbench  solves  this  problem  for  at  least  48  abbreviations. 
Presumably,  every  time  a  period  is  encountered,  Writer's  Workbench  checks  a 
dictionary  of  these  48  abbreviations.  If  the  word  preceding  the  period  matches  one  of 
these  abbreviations,  then  Writer's  Workbench  does  not  count  this  as  an  end  to  a 
sentence.  A  similar  solution  could  be  implemented  for  The  Stylist. 

2.  TRANSLATION  TO  TURBO  PASCAL 

The  design  of  The  Stylist  kept  in  mind  the  working  environment  of  a  personal 
computer  with  1.5  megabytes  of  main  storage.  A  good  follow-on  project  for  The 
Stylist  would  be  to  translate  the  code  from  Waterloo  Pascal  to  Turbo  Pascal.  Copies 
of  this  program  could  then  be  distributed  and  possibly  even  marketed. 

3.  IMPROVEMENTS  OF  THE  CONCORDANCE 

In  addition  to  the  main  CST  and  the  FourDix,  a  third  CST  could  be  added.  This 
new   CST's   node's   key   values   would   be   characters    of  punctuation.     During   the 
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execution  of  Inorder_Concordance,  the  procedure  that  creates  the  concordance,  the 
traversals  "lastward"  and  "nextward"  could  terminate  upon  encountering  a  character  of 
punctuation  in  this  third  CST.  This  would  mean  the  lines  of  the  concordance  would 
only  contain  phrases  extracted  from  single  sentences.  This  may  or  may  not  be  a 
desirable  feature.  I  myself  found  it  useful  to  read  entire  phrases,  even  when  they 
overlapped  into  preceding  or  succeeding  sentences. 

More  sophisticated  concordances  include  at  the  end  of  each  line  a  note  as  to 
where  this  line  can  be  found.  The  third  CST  outlined  above  could  contains  as  a  field  in 
its  main  node  just  such  a  note,  with  the  name  of  the  user's  input  text,  the  page  number 
and  the  line  number.  Such  an  extension  would  be  valuable  for  scholarly  research.  For 
the  short,  single  input  texts,  such  notes  are  not  necessary. 

4.  PARTS  OF  SPEECH 

A  considerably  more  ambitious  extension  would  be  to  provide  The  Stylist  the 
facility  of  determining  the  parts  of  speech.  Other  programs  [Ref.  8]  [Ref.  9]  [Ref.  10] 
solving  this  problem  use  thousands  of  rules.  The  disambiguation  of  parts  of  speech 
would  raise  The  Stylist  to  a  whole  new  order  of  complexity.  It  would  also  allow  more 
fine  analysis  of  the  characteristics  of  the  words.  For  example,  "like"  as  a  verb  has 
much  different  connotations  than  "like"  as  a  preposition.  This  facility  would  also 
allow  other  stylistic  measures  to  figure  into  the  profiles,  such  as,  verb-adjective  ratio 
and  percentage  of  modifiers. 

5.  PASSIVE  VOICE 

The  Researcher  module  could  be  extended  to  detect  the  use  of  passive  voice. 
Ideally,  this  would  be  accomplished  in  the  context  of  the  extension  involving  parts  of 
speech,  so  that  active  verbs  would  also  be  detected.  A  simpler  solution  is  readily 
available,  however.  Researcher  could  raise  a  flag  every  time  it  receives  a  form  of  the 
verb,  "to  be":  that  is,  "is",  "are",  "am",  "was",  "were",  "being",  and  "been".  This  flag 
would  remain  raised  for  the  next  three  words.  If  any  of  these  three  words  ended  in 
"-ed",  then  Researcher  would  count  this  as  an  example  of  the  use  of  the  passive  voice. 
Again,  I  haven't  implemented  this  solution  because  it  is  not  central  to  the  concept  of 
affective  modeling  and  because  I  didn't  want  to  pay  for  the  extra  computation. 

6.  CORRECTION  OF  POOR  DICTION  AND  GRAMMAR 

Another  ambitious  extension  of  The  Stylist  would  be  to  provide  it  with  the 
facility  of  correcting  poor  diction  and  grammar.    Both  RightWriter  and  DICTION  of 
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Writer's  Workbench  contain  dictionaries  of  commonly  used  cliches  or  repetitive 
redundancies.  Some  examples  from  DICTION  are  :  "a  great  deal  of,  "in  regards  to", 
"make  adjustments  to"  and  so  on.  One  solution  for  The  Stylist  would  be  to  include  a 
new  field  in  the  dictionary.  Words  like  "deal",  "regards"  and  "adjustments"  would 
contain  in  this  field  a  pointer  to  a  table  containing  the  poor  phrases  and  their 
replacements.  Such  a  solution  would  also  require  Researcher  to  maintain  a  phrase  of 
the  three  or  four  most  recently  received  words,  so  that  it  could  verify  that  "regards" 
appeared  in  the  offending  context  of  "in  regards  to". 

Grammatical  corrections  would  require  an  even  more  ambitious  extension.  To 
perform  properly,  such  a  facility  would  have  to  be  an  extension  of  the  parts  of  speech 
extension.  Determining  agreement  between  subject  and  verb,  for  example,  would 
require  first  the  identification  o[  the  subject  and  the  verb.  Frankly,  it  would  be  easier 
to  incorporate  the  affective  modeling  of  The  Stylist  into  a  more  extensive  program  such 
as  Writer's  Workbench  than  it  would  be  to  extend  The  Stylist  this  far. 
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APPENDIX  B 
USER'S  MANUAL 

1.        INTRODUCTION 

The  Stylist  is  a  Waterloo  Pascal  program  that  analyzes  the  style  of  English  prose, 
both  fiction  and  nonfiction.  The  Stylist  package  contains  the  following  :  this  User's 
manual,  and  the  following  electronic  files  :  the  Waterloo  Pascal  code  of  The  Stylist 
itself,  the  text  file  dictionary  of  The  Stylist,  and  two  Waterloo  Pascal  programs  for 
helping  to  maintain  the  dictionary,  DIXSPLIT  and  DIXJOIN. 

The  fundamental  idea  of  The  Stylist  is  that  individual  words  have  power.  The 
denotation  of  a  word  is  its  meaning.  The  connotation  of  a  word  is  its  emotional 
impact.  For  example,  "pupil",  "student",and  "scholar"  all  denote  a  person  who  studies 
and  learns.  The  emotional  connotations,  however,  range  from  the  humble  "pupil" 
through  the  familiar  "student"  to  the  lofty  "scholar".  Besides  these  emotional 
connotations,  words  have  other  intrinsic  values  which  can  be  quantified.  The  Stylist 
concentrates  on  the  following  values:  Etymology,  Tangibility,  Difficulty,  Emotional 
Connotation  and  Vigor. 

a.  Etymology 

One  of  the  beauties  of  the  English  language  is  that  its  vocabulary  embraces 
two  main  sources  :  native  and  borrowed.  Our  native  words  mainly  come  to  us  from 
the  Anglo-Saxon  tongues.  Borrowed  words  come  from  Latin,  mostly  by  way  of  the 
French  of  the  Norman  conquest.    See  Table  7,  which  illustrates  the  difference. 


TABLE  7 

EXAMPLES  OF  ETYMOLOGY 

Native  Word 

s 

Borrowed  Words 

Man 

Person 

Finger 

Digit 

Thinker 

Philosopher 

Fire 

Conflagration 

Book 

Volume 

Dirtbag 

Miscreant 
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Native  words  tend  to  be  short,  strong  and  rough.  Borrowed  words  tend  to  be 
long,  mild  and  elegant.  Good  English  prose,  particularly  good  fiction,  tends  to  use 
native  words.  Poor  English  prose,  particularly  bad  technical  writing,  exhibits  a 
tendency  to  overutilize  latinate  etymology.  By  counting  up  the  number  of  native  and 
borrowed  words  in  your  input  text,  The  Stylist  sees  how  it  compares  to  good  writing  of 
the  appropriate  genre, 
b.  Tangibility 

A  word  can  either  evoke  an  image  of  a  thing  or  it  can  refer  to  an  idea.  The 
former  words  are  "tangible"  and  the  latter  are  "intangible".  Table  8  illustrates  the 
difference. 


TABLE  8 
EXAMPLES  OF  TANGIBILTY 


TANGIBLE  INTANGIBLE 

Rock  Ethereal 

Lips  Automatically 

Beehive  Then 

Corvette  Rely 

Hammer  Preliminary 


Tangible  words  are  concrete,  exact  and  evocative.  Intangible  words  are  ideal, 
general  and  cognitive.  All  writing  uses  both.  Good  writing,  however,  usually  takes 
advantage  of  tangible  words.  Even  the  most  philosophical  writing  benefits  from  the 
use  of  tangible  words.  Good  fiction  rarely  strays  too  far  toward  the  intangible.  The 
Stylist  counts  the  tangible  and  intangible  words  of  your  input  text  and  compare  this 
count  to  good  writing  of  the  appropriate  genre, 
c.  Difficulty 

This  is  one  of  the  most  indisputable  characteristics  of  words.  The  Stylist 
defmes  four  levels  of  difficulty  :  Elementary,  High  Schdol,  Graduate  and  Postgraduate. 
The  words  of  each  level  are  those  most  likely  to  be  used  in  speech  with  ease  by  an 
average  person  of  that  educational  level.   Table  9  illustrates  the  categories. 

By  counting  the  occurrences  of  difficult  words,  The  Stylist  could  determine  the 
overall  difficulty  of  the  vocabulary  and  thus  the  readability  of  your  text. 
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TABLE  9 
EXAMPLES  OF  DIFFICULTY 


ELEMENTARY 

Big 

Sister 

Spoon 

Flying 

Wi  sh 

Done 

Handsome 


HIGH  SCHOOL 

Tardy 

Rendezvous 

Transmission 

Foreman 

Process 

Joyous 

Undergo 


GRADUATE 

Matrix 

Immaterial 

Tonality 

Universal 

Induction 

Processor 

Linear 


POSTGRADUATE 

Execrable 

Parsimony 

Recursive 

Homomorph 

Dilatory 

Zygote 

Synergy 


d.  Emotional  Connotations 

The    Stylist    defines    five    categories    of  emotional    connotation    :     Sublime, 
Pleasant,  Neutral,  Unpleasant  and  Horrid.   Table  10  illustrates  the  categories. 


TABLE  10 

EXAMPLES  OF  EMOTIONAL  CONNOTATION 

SUBLIME 

PLEASANT 

NEUTRAL 

UNPLEASANT 

HORRID 

Beauty 

Happy 

The 

Damage 

Cancer 

Sunrise 

Food 

Which 

Loss 

Murder 

Victory 

Friendly 

Brick 

Insulting 

Whore 

Love 

Warm 

Is 

Loser 

Fucks 

God 

Helpful 

Name 

Cost 

Death 

Paradise 

Sex 

When 

Wound 

Traitor 

By  counting  the  use  of  these  types  of  words,  The  Stylist  could  determine  the  overall 
emotional  tone  of  your  passage.    It  could  also  detect  flat  or  highly  emotional  writing, 
e.  Vigor 

A  related  but  distinct  characteristic  is  vigor.  Sublime  and  horrid  words  tend  to 
be  highly  vigorous,  but  not  all  highly  vigorous  words  are  emotional.  Examples  of 
vigorous  but  unemotional  words  are  "sprint",  "rush"  and  "cross".  This  category  also 
tends  to  be  more  objective.  For  example,  the  word,  "soldier"  can  have  widely  different 
connotations  for  different  people.  The  word  "soldier"  would  please  a  career  Army 
officer  but  it  would  displease  a  survivor  of  the  Japanese  occupation  of  Canton.  Both 
could  agree  that  "solider"  is  a  vigorous  word. 
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TABLE  11 

EXAMPLES  OF  VIGOR 

VIOLENT 

ENERGETIC 

CALM 

INERT 

Destroy 

Sprint 

Read 

From 

Creation 

Dive 

Write 

Into 

Fire 

Discipline 

Manager 

Something 

Atomic 

Wedding 

Ocean 

Comma 

Holocaust 

Steam 

Blue 

Paper 

Conqueror 

Flying 

Sought 

Format 

By  calculating  the  vigor  the  words  of  your  input  text,  The  Stylist  would  be 
able  to  estimate  its  overall  strength. 


2.        USING  THE  STYLIST 

The  first  step  to  using  the  Stylist  is  to  save  your  piece  of  writing  as  "Input  text 
a".    You  should  be  aware  of  the  following  principles: 

1.  Your  input  text  should  be  at  least  500  words  long.  The  Stylist  will  examine 
shorter  texts,  but  the  statistical  sample  of  shorter  texts  is  too  small  for  valid 
analysis. 

2.  Your  input  text  should  not  be  longer  than  500  sentences  (about  10.000  words). 
Such  long  texts  require  so  much  computation  that  most  operating  systems  such 
as  MVS  will  terminate  execution  before  completion. 

3.  The  Stylist  is  not  meant  to  analyze  non-prose  constructions  such  as  tables,  lists, 
references  and  bibliographies.   You  should  eliminate  these  from  your  input  text. 

In  the  multiprocessing  environment,  DEFINE  MAIN  STORAGE  at  1500k.  The 
VM  CMS  command  is  "Define  storage  1500k"  followed  by  "I  CMS". 

Once  your  input  text  is  properly  stored,  execute  The  Stylist.  DO  NOT  interrupt 
The  Stylist  during  compilation  or  execution  by  hitting  any  keys.  Always  wait  for  The 
Stylist  to  display  a  screen  with  a  prompt.  After  The  Stylist  is  compiled,  you  will  see 
just  such  a  series  of  screens  that  will  ask  you  for  information.  The  following 
discussions  will  help  you  decide  how  to  answer. 

a.  Name  of  Intext 

The  first  screen  will  ask  you  for  the  name  of  your  input  text.  Simply  type  in 
the  name.  You  can  use  any  characters.  The  maximum  length  of  the  name  is  40 
characters.   Hit  "enter". 
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b.  Fiction  or  NonFiction 

The  screen  will  automatically  clear.  A  new  screen  will  ask  you  if  your  input 
text  is  fiction  or  nonfiction.  Enter  "1"  or  "2".  If  you  enter  any  other  character,  The 
Stylist  will  merely  ask  you  again. 

c.  Report  Frequency 

The  next  screen  asks  whether  you  want  reports  "frequently",  "seldomly"  or 
"when  done".  A  report  is  a  screen  which  The  Stylist  displays  to  you  during  execution, 
telling  you  what  it's  doing,  how  much  it's  done  and  how  much  remains  to  be  done. 
The  act  itself  of  producing  such  takes  time  :  the  more  frequent  the  reports,  the  longer  it 
takes.  You  should  probably  enter  "2"  for  "Seldomly".  Most  users  find  this  the  most 
agreeable. 

d.  Concordance 

The  next  screen  asks  whether  or  not  you  want  a  concordance.  A  concordance 
is  an  alphabetical  listing  of  all  the  significant  words  of  your  input  text.  ("Significant" 
words  are  all  words  other  than  functional  words  such  as  "the",  "and",  "as",  and  so  on.) 
Each  line  of  the  concordance  contains  the  key  word  and  the  phrases  preceding  and 
succeeding  it.    The  following  is  an  example  of  a  concordance. 

them  and  thamus  enquired      ABOUT     their  several  uses  and 

discovered  is  not  an      AID     to  memory  but  to 

other  egyptians  might  be      ALLOWED     to  have  the  benefit 

an  art  is  not      ALWAYS     the  best  judge  of 

is  called  by  them     A  MM  ON     to  him  came  theuth 

learned  nothing  they  will      APPEAR    to  be  omniscient  and 

censured  others  as  he      APPROVED     or  disapproved  of  them 

many  arts  such  as      ARITHMETIC    and  calculation  and  geometry 

or  inventor  of  an      ART    is  not  always  the 

the  inventor  of  many     ARTS     such  as  arithmetic  and 

blame  of  the  various      ARTS     but  when  they  came 

calculation  and  geometry  and      ASTRONOMY    and  draughts  and  dice 

have  been  led  to      ATTRIBUTE     to  them  a  quality 

in  the  learners'  souls      BECAUSE    they  will  not  use 

vour  own  children  have      BEEN     led  to  attribute  to 
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Most  users  find  that  a  concordance  offers  them  interesting  insights  into  their 
use  of  words.  The  only  drawback  of  producing  a  concordance  is  that  it  takes  The 
Stylist  additional  time.  If  you  elect  this  option,  your  concordance  will  be  printed  to  a 
separate  file  called,  "Concrdnc  text  a". 

e.  Number  of  Words  in  Intext 

The  next  screen  asks  you  for  the  number  of  words  in  your  text.  If  you  don't 
know,  make  any  guess.  This  number  is  only  used  in  the  screens  displaying  how  far 
along  The  Stylist  is  in  reading  the  text.  Your  guess  will  have  no  effect  on  the  analysis 
of  the  text.  The  Stylist  counts  the  words  itself  and  will  report  the  exact  number  to  you 
when  it  finishes. 

f.  Expand  the  Dictionary 

The  next  screen  will  ask  whether  you  want  to  expand  the  dictionary  of  The 
Stylist  or  not.  This  dictionary  is  a  listing  of  about  3200  of  the  most  commonly  used 
words,  together  with  five  values  for  each  word.  If  you  answer  "1"  for  "Yes",  the  Stylist 
will  ask  you  to  give  it  a  values  for  every  word  in  you  input  text  which  is  not  found  in 
the  dictionary.  It  will  then  add  these  new  words  and  their  values  to  the  dictionary.  If 
you  answer  "2"  for  "No",  then  The  Stylist  will  write  all  of  the  unknown  words  to  a  file 
called  "Newwords  text  a".  Reviewing  "Newwords"  will  give  you  an  idea  of  the  words 
that  you  use  which  aren't  in  the  dictionary. 

The  first  few  times  that  you  use  The  Stylist,  you  probably  should  answer  "2" 
for  "No".  Entering  the  values  for  the  new  words  can  be  tedious.  You  probably  want 
to  see  how  The  Stylist  works  before  you  begin  to  expand  or  modify  the  dictionary. 
After  you've  used  The  Stylist  for  a  while,  you'll  probably  want  to  begin  adding  some 
words  to  the  dictionary.  Words  peculiar  to  your  field  of  writing  or  the  jargon  of  your 
profession  are  examples  of  the  types  of  words  which  should  be  entered.  Adding  them 
will  tailor  The  Stylist  to  your  working  environment.    Keep  the  following  in  mind  : 

***  Every  word  added  to  the  dictionary  is  another  word 
that  will  have  to  be  loaded  into  memory  during  each 
execution  of  The  Stylist.   Adding  many  words  will  slow 
down  execution  and  it  will  decrease  the  amount  of  space 
in  main  storage.   If  the  dictionary  grows  beyond  5000 
words,  you  should  weed  out  words  that  are  rarely  used 
in  your  type  of  writing.    To  do  this,  simply  delete 
that  line  from  the  file  "Dixonarv  text  a".    *** 
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If  you  answer  "\"  for  "Yes",  as  the  Stylist  encounters  each  new  word  in  your 
text,  it  will  ask  if  you  want  to  add  this  particular  word  to  the  dictionary.  If  the  word  is 
a  misspelling  or  if  it  is  a  word  that  you  rarely  use,  don't  add  it.  Also,  you  should  know 
that  its  best  to  add  the  root  form  of  words.  For  example,  neither  "burns"  nor  "burned" 
nor  "burning"  should  be  added.  It's  much  better  to  add  "burn".  To  keep  the 
dictionary  as  compact  as  possible,  note  down  the  root  form  of  words  you  wish  to  add 
to  the  dictionary  and  append  them  to  the  end  of  the  next  text  you  run  through  The 
Stylist. 

The  Stylist  will  then  ask  you  for  the  values  of  the  words  you  do  want  to  add 
to  the  dictionary.   The  following  will  help  you  decide  how  to  answer  these  prompts  : 

/.   Source 

If  you're  uncertain  about  the  etymology  of  the  word,  simply  look  it  up  in  a 
collegiate  dictionary.  Old  English,  Middle  English,  Norse,  German,  Old  German, 
Middle  German  and  all  Celtic  languages  are  considered  "Germanic".  Latin,  French, 
Spanish,  and  all  other  languages  are  considered  "Latin". 

2.    Difficulty 

Each  category  contains  those  words  which  are  most  likely  to  be  used  in 
speech  by  persons  of  that  educational  level.  If  you're  still  uncertain  of  the  meanmg  of 
these  categories,  the  following  are  examples  of  the  categories  : 

POSTGRADUATE  DIFFICULTY 

aggregate 

algorithm 

algorithmic 

ambiguity 

analogue 

applicative 

conceptualize 

conglomerate 

constriction 

consummate 

consummation 

convoluted 

convolution 
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culminate 

cyclical 

digitize 

GRADUATE  DIFFICULTY 

accumulate 

acrid 

acute 

adhere 

adjoin 

allocate 

allude 

analogy 

analysis 

analyst 

ancestor 

ancestral 

annihilate 

apparatus 

appendage 

arc 

arch 

array 

HIGH  SCHOOL  DIFFICULTY 

abandon 

ability 

able 

abrupt 

absence 

absent 

absorb 

acceptable 

access 
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accommodate 

accomplish 

accomplishment 

account 

accurate 

achievement 

activate 

adapt 

additional 

adjust 

advantage 

ELEMENTARY  DIFFICULTY 

about 

above 

accept 

acceptance 

ache 

across 

act 

action 

active 

activity 

actor 

add 

addition 

address 

admit 

advance 

adventure 

afraid 

after 

afternoon 

asain 
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3.  Concreteness 

Does  this  word  name  a  thing  that  you  can  touch?    If  it  does,  then  answer 
"1"  for  "Tangible".   Otherwise,  answer  "2". 

4.  Emotional  Connotation 

What  feelings  does  this  word  evoke  in  you?    If  none,  answer  "3"  for  neutral. 
Otherwise,  answer  in  the  appropriate  category. 

5.  Vigor 

Vigorous  words  are  not  necessarily  verbs.  Words  like  "conquest"  are  highly 
vigorous.  "Violent"  vigor  means  "highly"  vigorous.  Some  of  the  words  in  the  "violent" 
category  are  "creation",  "triumph"  and  "epiphany".  This  category,  as  does  emotional 
connotation,  calls  for  your  subjective  reaction.  Don't  be  afraid  to  "guess"  which  value 
is  appropriate.  Such  a  "guess"  is  in  fact  your  subjective  reaction, 
g.  The  End  of  Execution 

The  above  are  all  the  screens  which  prompt  you  for  answers.  All  the  rest  of 
the  screens  will  merely  inform  you  about  the  progress  of  the  execution.  When 
execution  ends,  you  should  look  into  the  following  files  :  "Newwords  text  a",  if  you 
decided  not  to  let  the  dictionary  expand;  "Concrdnc  text  a",  if  you  elected  to  have  a 
concordance  produced;  and  most  importantly,  "Report  text  a",  which  is  The  Stylist's 
report  on  your  text. 

3.       THE  MEANING  OF  YOUR  REPORT 

a.  Measures  of  Length 

The  first  two  pages  of  your  report  deals  with  the  number  of  words,  the  length 
of  words  and  the  length  of  sentences.  These  measures  are  always  interesting  but  not 
always  illuminating.  The  first  bar  graph  shows  the  distribution  of  the  length  of  the 
words  of  your  text.  Each  horizontal  column  represents  the  percentage  of  the  words  of 
your  next  which  were  of  so  many  letters  of  length.  The  overall  bar  graph  should  look 
like  a  bell-shaped  curve.  This  bar  graph  does  not  include  function  words  such  as  "a", 
"an",  "the",  "then"  and  "or".  The  blurb  below  the  bar  graph  explains  how  your  text 
compares  to  other  texts  of  the  same  genre.  The  only  clear  indication  of  prose  in 
trouble  would  be  a  dramatic  slewing  of  the  curve  to  the  right.  This  would  mean  that 
you're  using  too  many  windy  words.  Table  12  breaks  down  the  characterizations  of 
word  length. 
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TABLE  12 

WORD  LENGTH 

Characterization 

Average  Word  Length 

( lower  boundary) 
Fiction   Nonficnon 

Too  Long 
Long 
Medium 
Short 
Too  Short 

6          7 
5.5         6.5 
5           5.  5 
4          5 

0           0 

The  second  bar  graph  shows  the  length  of  the  sentences  of  your  input  text. 
Each  bar  represents  the  length  of  a  sentence.  You  can  use  this  graph  to  identify  run- 
on  sentences.  Overall,  this  bar  graph  should  resemble  the  skyline  of  a  modest  city  such 
as  San  Francisco.  If  it  resembles  New  York,  then  you  tend  to  write  long  sentences. 
Table  13  explains  the  characterizations  of  sentence  length. 


TABLE  13 
SENTENCE  LENGTH 


Characterization 

Too  Long 
Long 
Medium 
Short 
Too  Short 


Lower  boundary  of  length 

22 

18 

15 

10 

0 


One  of  the  measures  below  this  bar  graph  is  "Number  of  sweet  spots".  This  is 
the  number  of  sentences  ranging  between  9  and  19  words.  The  idea  here  is  that  most 
ideas  can  find  a  home  in  a  sentence  of  this  length.  Table  14  breaks  down  how  The 
Stylist  characterizes  the  modulation  of  your  sentences. 

Another  count  here  is  the  number  of  run-on  sentences.  The  Stylist  defines 
any  sentence  over  45  words  in  length  as  a  run-on.  The  Stylist  realizes  that  many  grand 
sentences  over  45  words  in  length  are  not  run-ons,  but  it's  willing  to  bet  that  most 
overlone  sentences  are. 
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TABLE  14 
MODULATION 


Percentage  Characterization 

Above    50%  Good 

20%   to    50%  Average 

Below   20%  Poor 


b.  Measures  of  Word  Characteristics 
/.    Etymology 

The  next  item  in  the  report  is  the  number  of  Latinate  and  Germanic  words. 
Notice  that  these  two  numbers  will  not  add  up  to  the  total  number  of  words  in  the  text 
unless  even'  word  in  the  text  happened  to  belong  to  the  dictionary.  The 
characterization  of  your  text  in  this  category  depends  on  the  ratio  of  latin  to  germanic 
words.  Good  writing  generally  tends  to  be  native.  Table  15  breaks  down  these 
characterizations. 


TABLE  15 

CHARACTERIZATIONS  OF  ETYMOLOGY 

Characterization 

Ratio  of  Latin  to  German 
Fiction   NonFiction 

Very  Borrowed 

Borrowed 

Mixed 

Native 

Very  Native 

2:  1       3:1 
1.  5:  1       2:  1 
1:1        1:1 
.  5:  1        .  9:  1 
<.  5:  1      <.  9:  1 

2.   Difficulty 

The  next  entry  shows  the  tally  of  the  numbers  of  words  of  the  various 
levels  of  difficulty,  followed  by  the  percentage  of  those  levels.  The  difficulty  of  the 
text's  vocabulary  should  be  appropriate  for  its  intended  audience.  Moreover,  you 
should  remember  than  many  outstanding  works  of  literature  convey  difficult  ideas 
without  resorting  to  difficult  words.  Table  16  describes  the  characterizations  of  this 
category. 
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TABLE  16 

i 

CHARACTERIZATIONS  OF  DIFFICULTY 

Characterizat 

ion   Cause 

Very  Hard 
Hard 

Challenging 
Easy 

%postgrad    >  0 
%grad       >  5 
%nighschool  >  10 
above  percentages 
do  not  pertain 

3.    Tangibility 

The  next  entries  show  the  total  of  the  tangible  and  intangible  words.  As  in 
the  etymology  category,  the  characterizations  depend  upon  the  ratio  between  these 
words  and  the  genre.    Generally,  good  writing  is  tangible. 


TABLE  17 
CHARACTERIZATIONS  OF  TANGIBILITY 


Ratio  of  Intangible  to  Tangible 
Characterization      Fiction    Nonfiction 


Very  tangible 

Tangible 

Very  intangible 


>1.  5:  1  >2:  1 

1.  5:  1  2:  1 

3:1  4:  1 


4.    Emotional  Connotation 

The  next  breakdown  is  of  the  number  and  percentages  of  the  words 
belonging  to  the  various  categories  of  emotional  connotation.  Following  these  values 
is  something  called  the  "Index  of  Emotion".  The  following  is  the  formula  used  to 
derive  this  index  : 

Index  =  (Percent  Horrid  times  5)  ■+- 

(Percent  Unpleasant  times  2)  ■+- 
(Percent  Pleasant  times  2)  +• 
(Percent  Sublime  times  5) 
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The  Index  of  Emotion  is  simply  a  number  that  conveys  a  sense  of  the  use 
of  strongly  emotional  words.  The  most  emotional  words,  Horrid  and  Sublime,  are 
weighted  the  most  heavily.  This  Index  allows  The  Stylist  to  compare  your  use  of 
emotional  words  to  the  use  found  in  other  texts.  Table  18  shows  the  thresholds  of 
such  uses. 


TABLE  18 
CHARACTERIZATIONS  OF  EMOTIONALITY 

Characterization        Lower   Boundry   of    Index 

Rich  20 

Average  10 

Poor  0 


Following  this  characterization  is  another  one,  this  time  for  the  overall  tone 
of  your  text.  The  Stylist  compares  the  frequency  of  use  of  positive  words  to  the 
frequency  of  use  of  negative  words.  If  positive  words  prevail,  the  tone  is  positive.  If 
negative  words  prevail,  the  tone  is  negative.  If  the  two  balance  and  the  index  of 
emotion  is  high,  then  the  tone  is  characterized  as  a  balance  of  strong  positive  and 
strong  negative  emotions.  If  the  two  balance  and  the  index  of  emotion  is  low,  then  the 
tone  is  characterized  as  bland. 
5.    Vigor 

The  final  breakdown  is  that  of  the  number  and  percentage  of  words  of  the 
various  categories  of  vigor.  Following  this  is  an  "Index  of  Strength",  which  is  a 
measure  similar  to  the  Index  of  Emotion.  The  following  is  the  formula  used  to  derive 
the  Index  of  Strength  : 

Index  of  Strength  =  (Percent  of  violent  words  times  10)  + 

(Percent  of  energetic  words  times  5)  + 
(Percent  of  calm  words) 
This  Index  allows  the  Stylist  to  compare  your  use  of  vigorous  words  to  uses 
found  in  other  texts.   Table  19  explains  how  The  Stylist  uses  the  index  to  characterize 
your  text.    Most  good  writing  is  strong  or  lively.    Usually  only  action-packed  fiction 
reaches  the  upper  registers  of  "very  strong". 
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TABLE  19 
CHARACTERIZATIONS  OF  VIGOR 


Characterization  Lower  Boundry  of  Index 

Very  Strong  60 

Strong  50 

Lively  20 
Weak  0 


6.    Recommendations 

The  final  section  of  your  report  includes  recommendations  and 
commendations.  The  Stylist  is  meant  to  be  a  descriptive  rather  than  a  prescriptive 
tool.  Therefore,  it  makes  recommendations  only  when  some  aspect  of  your  writing 
seems  well  outside  of  the  bounds  of  the  normal.  For  this  reason,  you  should  consider 
the  recommendations  of  The  Stylist.  They  apply  only  to  what  appear  to  be  extreme 
cases.  On  the  other  hand,  you  should  never  accept  these  recommendations  as  the 
pronouncements  o^  some  oracle.  The  Stylist  is  merely  a  machine.  When  in  doubt, 
trust  your  own  instincts. 

4.        MAINTENANCE  OF  THE  STYLIST 

a.  Care  of  the  Dictionary 

Many  of  the  issues  regarding  the  care  of  the  dictionary  have  already  been 
covered.    Here  are  some  other  helpful  pointers  : 

1.  Always  maintain  more  than  one  copy  of  the  dictionary.  Loss  of  the  dictionary- 
renders  The  Stylist  useless. 

2.  Beware  of  changing  the  values  directly.  You  should  never  do  this  to  the 
dictionary  itself.  While  using  DIXSPLIT,  it  is  necessary  to  go  into  the  split 
files  and  change  the  codes  for  the  words  which  don't  belong  in  that  split  file.  If 
you  make  a  mistake,  DIXJOIN  will  merely  assign  the  most  common  value  for 
that  category. 

b.  Changing  the  Code  of  The  Stylist 

The  Stylist  comes  with  embedded  documentation.  Make  sure  that  you  read 
this  documentation  prior  to  changing  any  of  its  code.  You  should  read  the  full 
master's  thesis  pertaining  to  The  Stylist  prior  to  making  any  substantive  changes. 5 


3See  Master's  Thesis  :    The  Stylist  :  A  Pascal  Program  for  Analyzing  Prose  Style, 
by  Lt.  Thomas  C.  Cool,  USN,  Naval  Postgraduate  School,  Monterey,  California,  1987. 
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TABLE  20 

CODES  OF  THE  DICTIONARY 

COLUMN 

First 

Second 

Third 

Fourth 

Fifth 

Etymology 

Tangibility 

Difficulty 

Emotion 

Vigor 

1( atin) 
g(  erman) 

t( angible ) 
i( ntang.  ) 

e( lement. ) 
h( igh  sc. ) 
g  rad) 
p( ost  gr. ) 

h( orrid] 
u( nple. 
n(  eutr. 
p( leas. 
s(  ubli.  ' 

i( nert) 
c( aim) 
e(  nerg.  ) 
v(  iole.  ) 

Substantive  changes  should  also  bear  in  mind  the  modularization  of  the 
design.  Reader  module  bears  full  responsibility  for  reading  the  user's  input  text.  It 
passes  along  words  (in  lower  case)  and  their  length.  It  also  signals  the  ends  of 
sentences  and  the  end  of  the  file.  Researcher  Module  bears  full  responsibility  for 
maintaining  and  using  the  dictionary,  for  building  up  the  profile  of  the  user's  input 
text,  and  for  building  the  concordance.  Finally,  Reporter  module  accepts  the  profile, 
creates  the  analysis  and  writes  out  the  report. 

Some  fine  tuning  of  The  Stylist  is  readily  available.  The  global  constants  give 
you  the  capability  to  adapt  The  Stylist  to  your  operating  environment.  For  example, 
"Maxsent"  allows  you  to  determine  the  maximum  number  of  sentences  in  the  input 
text.  Moreover,  by  manipulating  the  weighting  factors  in  the  Calculate  procedures  of 
the  Reporter  module,  you  can  adapt  The  Stylist  to  the  norms  of  a  particular  school  of 
writing,  such  as  submissions  to  magazines  as  varied  as  "Fantasy  and  Science  Fiction" 
and  "Foreisn  Affairs." 
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APPENDIX  C 
THE  CODE  OF  THE  STYLIST 


(x    NOTE  «  THE  FOLLOWING  IS  NOT  IN  EXECUTABLE  ORDER. 

IT  IS  IN  THE  ORDER  OF  GREATEST  CONCEPTUAL  CLARITY,  x) 

(X$S60000X) 

(XXXXX*XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX) 
(X  X) 

(x  THE  STYLIST  x) 

(X  X) 

(x  a  Pascal  Program  for  Analyzing  Prose  Style  X) 

(X  by  Lieutenant  Thomas  C.  Cool,  USN  X) 

(X  Submitted  in  partial  fulfillment  of  the  requirements  x) 

(x  for  the  degree  of  x) 

(X  MASTER  OF  SCIENCE  IN  COMPUTER  SCIENCE  x) 

(X  from  the  X) 

(X  NAVAL  POSTGRADUATE  SCHOOL  X) 

(X  June  1987  X) 

(X  X) 
(XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX) 

program  Stylist  (input,  output); 
const 

maxsent  =  500;         (x  limit  of  number  of  sentences  in  intext  X) 

wordlength  =  15; 

linelength  =  40; 

phraseiength  =  4;       (X  length  of  phrase  around  condcordance  linex) 

freqnum  =  50;  (X  the  three  numbers  determine  how  often     X) 

selnum  =  200;  (x  Stylist  reports  to  user  during  execution^) 

whendonenum  =  1000000;  (x  frequently,  seldomly  or  when  done        X) 

type 

(x  GLOBAL  TYPES  X) 

Genretype  =  (Nonfiction,  Fiction); 

Frequencytype  =  (Frequently,  Seldomly,  When_Done); 

Hordtype  =  packed  array  (. 1 . .wordlength. )  of  char; 
Linetype  =  packed  array  (. 1 .. linelength . )  of  char; 

User_Inf otype  =  record 

Nane_of_text    linetype; 

Genre  :  Genretype; 

Size  :  Integer;  (X  users  guess  of  intext  length  x) 

Frequency  ;  Frequencytype;     (X  how  often  user  wants  reports  X) 

Hant_Concord  :  boolean; 

Want_Dix_Grow  :  boolean; 
end; 

(x  READER  MODULE  TYPES  x) 

Readtype  =  record      (X  created  by  Reader,  passed  to  Researcher  x) 

Word  :  wordtype; 

Length    integer; 
end; 
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(x  RESEARCHER  MODULE  TYPES  X) 

Balancetype  =  (Plus,  Zero,  Minus);   (x  Balance  of  Node  in  BST  x) 

Statustype  =  (Storable,  Not_Storable,  Not_Valuable) ; 

(x  Storable  -  a  full  node  with  all  values  which    x) 
(x  can  be  added  to  the  dictionary      X) 

(X  Not_Storable  -  a  node  of  a  variant  (-ing,  -ed)  X) 
(X  word  which  has  values  but  should     X) 

(X  not  be  added  to  dictionary  X) 

(X  Not_Valuable  -  a  node  of  a  new  word  without    X) 
(X  values.   Cannot  be  Add_Valued  and    X) 

(X  cannot  be  added  to  dictionary.       x) 

Sourcetype  =  (Latinate,  Germanic); 

Dif f icultytype  =  (PostGrad,  Grad,  High_School,  Elementary); 

Concretenesstype  =  (Tangible,  Intangible); 

Emotiontype  =  (Sublime,  Pleasant,  Neutral,  Unpleasant,  Horrid); 

Vigortype  =  (Violent,  Energetic,  Calm,  Inert); 

(x  tags  which  describe  each  word  in  the  dictionary         x) 

EntryPointer  =  -Entrytype; 
ConcordPointer  =  -Concordtype; 

Entrytype  =  record  (x  main  node  in  the  BST  x) 

Word  :  Wordtype;  (X  word  is  key  field    X) 

Balance  :  Balancetype; 

Status  :  Statustype; 

Source  :  Sourcetype; 

Difficulty  :  Dif f icultytype; 

Concreteness  :  Concretenesstype; 

Emotion  :  Emotiontype; 

Vigor  :  Vigortype; 

Left,  Right  :  EntryPointer; 

Down  :  Concordpointer; 
end; 

Concordtype  =  record  (X  a  Concord  node  is  a  cluster  of  pointers  x) 
Up  :  EntryPointer;  (x  which  create  linked  lists  from  the  firstx) 
Down,  (x  word  of  the  Intext  to  the  last,  and  fromX) 

Next,  (X  the  last  to  the  first.   Used  only  to    X) 

Last  :  ConcordPointer;  (x  create  the  concordance.  x) 

end; 
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LtrsPerWordtype  =  array  ( . 0 . . wordlength . )  of  integer; 
(x   array  of  total  number  of  words  of  each  length   x) 

WordsPerSenttype  =  array  ( . 0 . . maxsent . )  of  integer; 
(X  array  of  the  length  of  each  sentence  X) 

Profiletype  =  record  (*  the  profile  is  the  main  product  *) 

(x  of  the  Researcher  Module.   Here  x) 

(X  Researcher  keeps  track  of  all    X) 

(x  counts.  x) 

Totalwords    integer;         (X  Total  number  of  words  in  intext  X) 

Totalletters    integer; 

Totalsentences  :  integer; 

TotalStrucltrs  :  integer;    (X  Total  of  ltrs  of  structural  wrds  x) 

NumWordsThisSent    integer;  (x  f  words  in  sentence   being  read   X) 

Newwords    integer;  (X  f  words  user  adds  to  dixonary     X) 

KnownWords    integer;        (x  f  words  of  intext  found  in  dixon.x) 

VariantWords  :  integer;      (x  f  variant  words  of  intxt  in  dixonX) 

Structurewords    integer;    (x  #  stucture  words  of  intxt  in  4dixX) 

UnknownWords  :  integer;      (x  f  intext  words  not  found  anywhere*) 

Numfound  :  integer;  (X  f  total  words  minus  unknown  wordsX) 

LtrsPerWord  :  ltrsperwordtype; 

WordsPerSent  :  wordspersenttype; 

Ave_ltrs_per_word  :  real; 

Ave_wrds_per_sent    real; 

NumLatinate 

NumGermanic 


integer;  (X  rest  of  profile  are  counts  of  x) 

integer;  (X  words  of  the  intext,  found  in  X) 

integer;  (X  the  dictionaries,  which  have   X) 

NumGrad  :  integer;  (X  these  corresponding  values     X) 

NumHigh_School  :  integer; 


NumPostGrad 


NumElementary  :  integer; 
NumTangible  ••  integer; 
Numlntangible    integer; 
NumSublime  =  integer; 
NumPleasant  :  integer; 
NutnNeutral  :  integer; 
Numllnpleasant    integer; 
NumHorrid  ••  integer; 
NumViolent  :  integer; 
NumEnergetic    integer; 
NumCalm  :  integer; 
Numlnert  :  integer; 
end; 

Phrasetype  =  packed  array  ( . 1 . . phraselength . )  of  wordtype; 

(x  holds  the  phrases  bracketing  the  key  words  of  concordance  line  X) 
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(x  REPORTER  MODULE  TYPES  x) 

Length-type  =  (TooShort,  Short,  Medium,  Long,  TooLong); 

Etymologytype  =  (TooBorrowed,  Borrowed,  Mixed,  Native,  TooNative); 

Modulationtype  =  (Bad,  Average,  Good); 

Runonstype  =  (Unacceptable,  Acceptable,  Nonexistant ) ; 

Tangibilitytype  =  (Soft,  firm,  Solid); 

Strengthtype  =  (Weak,  Lively,  Strong,  VeryStrong); 

HardWordstype  =  (Easy,  Challenging,  Hard,  VeryHard); 

Emotionalitytype  =   (Poor,  Standard,  Rich); 

Tonetype  =  (Negative,  Bland,  Positive); 

(x  Judgements  of  intext  style  Reporter  makes  based  on  profile  X) 

Analysistype  =  record 

WordLength  :  Lengthtype; 

SentLength  :  Lengthtype; 

Modulation  :  Modulationtype; 

Runons    Runonstype; 

Etymology  :  Etymologytype; 

Tangibility  :  Tangibilitytype; 

Strength  :  Strengthtype; 

Hardwords    HardWordstype; 

Emotionality    Emotionalitytype; 

Tone    tonetype; 
end; 
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(x  GLOBAL  VARIABLES  X) 

var 

Info  :  User_Ihf otype; 

Allswell  ••  boolean;   (X  global  flag  that  execution  can  continue  x) 

(x  used  instead  of  a  GOTO  end  of  program   x) 

FourRoot,  (X  Root  of  the  FourDix  BST  X) 

Root  :  EntryPointer;         (X  Root  of  the  Dixonary  BST  x) 
Lastward  :  ConcordPointer;   (X  Pointer  to  last  word  read  X) 

Intext,  (X  file  that  holds  users  prose  X) 

Dixonary,  (X  dictionary  of  words  and  values  X) 

FourDix,  (x  diction,  of  short,  functional  words  X) 

Concrdnc,  (X  concordance,  created  by  researcher  X) 

NewNords,  (x  list  of  words  not  found  or  added  x) 

Report    text;  (X  main  product  of  Reporter  x) 


Tally,  (X  Researcher  scratch  pad  of  profile  X) 

Profile  :  profiletype;   (x  Researcher  input  to  Reporter     X) 

Analysis  :  analysistype;  (X  Judgements  made  by  reporter  X) 

(X  MAIN  PROGRAM  x) 

begin  (x  main  program  X) 

Interrogate_User; 

if  Allswell  then  Init_Reader; 

if  Allswell  then  Init_Researcher; 

if  Allswell  then  Ini t_Reporter; 

if  Allswell  then  Read_Intext; 

If  Allswell  then  Calculate_Prof ile; 

If  Allswell  then  Analyze_Prof i le_and_Report; 

If  Allswell  then  if  Inf o . Want_Concord  then 
W  r  i  t  e_C  o  n  c  o  r  da  n  c  e ; 

If  Allswell  then  if  Inf o . Want_Dix_Grow  then 
Store_New_Dix; 

If  not  Allswell  then 

Writeln  ('Execution  terminated  until  you  clear  that  up.1); 
end  . 
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riteln  (c);  Writeln; 
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Repeat 

Page;  Writeln;  Writeln;  Writeln; 

Writeln  ('  CONCORDANCE   f:<+5);  Writeln; 

Writeln  (•    1)  Yes      '  :45); 

Writeln  ( '    2)  No       ' :45); 

Read  (c);  Readln;  Writeln  (c:45); 

if  (c  =  '1')  then  Inf o . Want_Concord  :=  true 

else  if  (c  =  '2')  then  Inf o . Want_Concord  :=  false; 
Until  (c=  '1«)  or  (c  =  f2'); 
Repeat 

Page;  Writeln;  Writeln;  Writeln; 

Writeln  ('Approximately  how  many  words  are  in  '); 

Writeln  ( Inf o . Name_of_text,  '  ?':3); 

Info .size  : =  0 ; 

Good_integer  :=  true; 

while  not  EOLN  do  begin 
Read  (c); 
if  (c  in  ( .  »0' .  .  '9'  .  ))  then 

Info. size  :=  Info. size  x  10  +  ord(c)  -  ord('O') 
else  begin  good_integer  :=  false; 

Writeln  (c:2,  '  is  an  invalid  character.1);  end; 

end;  Readln; 
Until  Good_Integer; 
Writeln  ( Inf o . size : 45) ; 
Repeat 

Page;  Writeln;  Writeln;  Writeln; 

Writeln  (•    EXPAND  DICTIONARY  ?  »:45);  Writeln;  Writeln; 

Writeln  ('    1)  Yes      ' :45); 

Writeln  ( '    2)  No       » =  45); 

Read  (c);  Readln;  Writeln  (c:45); 

if  (c  =  '1')  then  Inf o . Want_Dix_Grow  :=  true 

else  if  (c  =  '2')  then  Inf o . Want_Dix_Grow: =false; 
Until  (c  =  '1')  or  (c  =  '2'); 
end;  (*  if  allswell  ^) 
end; 
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procedure  Init_Reader; 
begin 

Reset  (Intext,  'Intext  text  a'); 
end; 


procedure  Init_Researcher; 
var 
i  , 

stepcount, 
threshhold, 
dixlength  :  integer; 
entry    entrytype; 
balanced    boolean; 
begin 

Reset   (Dixonary,  'Dixonary  text  a'); 
Rewri te( Newwords,  'Newwords  text  a'); 

if  Inf o . Want_Ccncord  then  Rewrite  (Concrdnc,  'Concrdnc  text  a'); 
if  ( Info . frequency  <>  When_Done)  then  begin 
Dixlength  :=  0; 
while  not  EOF  (Dixonary)  do  begin 

Readln  (Dixonary);  Dixlength  :=  dixlength  +  1; 
end; 

Reset  (Dixonary/  'Dixonary  text  a*); 
end; 
case  Info . Frequency  of 


f reqnum; 

selnum; 

WhenDonenum; 


nil; 


Frequently  :  threshhold 
Seldomly    :  threshhold 
When_Done   :  threshhold 
end;  (*  cases  *) 
New  (Root); 
Load  (2,  Root-); 
Root-. left  :=  nil;  Root-. right 
Root- . down  : =  nil ; 
Root- . balance  :=  Zero; 
Root-. status  :=  Storable; 
Lastward  :=  nil; 
Stepcount  :  =  0; 

while  not  EOF  (Dixonary)  do  begin 
Load  ( 2,  Entry) ; 

Entry . balance  :=  Zero;  balanced  :=  false; 
AVL_Insert  (Entry,  Root,  balanced); 
if  ( Info . frequency  <>  When_Done)  then  begin 
stepcount  ■■  =    stepcount  +  1; 

if  (stepcount  mod  threshhold  =  0)  then  begin 
Page;  Writeln;  Writeln;  Writeln;  Writeln; 
Writeln  ('Stylist  is  now  loading  its  dictionary, 
Writeln; 
(  '        Last  word  loaded  was 
( entry .word: 40) ;  Writeln;  Writeln; 

('Stylist  has  loaded  ' : 30, stepcount : 5,  *  entries.'); 
('out  of  a  dictionary  of  * : 33, Dixlength : 5, *  words.'); 

Writeln; 
(Stepcount*100  div  Dixlength : 33,  •  X      complete.'); 
X) 


Writeln; 
Writeln 
Writeln 
Writeln 
Writeln 
Writeln; 
Writeln 
end;  (*  if 
end;  (X  if  X) 
end;  (X  while  not 


' ;50); 
' :50); 


EOF  X) 
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with  tally  do  begin 

totalletters  :=  0; 

totalwords  :  =  0; 

totalsentences  :=  0; 

totalstrucltrs  :=  0; 

numwordsthissent  :=  0; 

for  i  -.  =  0  to  wordlength  do  ltrsperword(  .  i  .  )  :=  0; 

for  i  : =  0  to  maxsent  do  wordspersent( . i  .  )  :=  0; 

KnownWords  : a  0; 

NewWords  :=  0; 

Numf ound  : =  0 ; 

VariantWords  :=  0; 

Structurewords  •■  -    0; 

UnknownWords  :=  0; 

NumLatinate  s -    0 ; 

NumGermanic  : =0 ; 

NumPostGrad  :=0; 

NunGrad  : =  0 ; 

NumHi gh_School  :=  0; 

NumElementary  :=  0; 

NumTangible  :  =  0; 

Numlntangibl e  :=  0; 

NumSublime  •■  =  0; 

NumPleasant  : =  0 ; 

NumNeutral  i -    0 ; 

NumUnpleasant  :'    0; 

NumHorrid  :=  0 ; 

NumViolent  :  =  0 ; 

NumEnergetic  :=  0; 

NumCalm  : =  0 ; 

Numlnert  : s  0; 
end;  (*  with  tally  do  X) 


Reset  (FourDix,  'FourDix  text  a'); 

New  (FourRoot);  FourRoot- . right  :=  nil;  FourRoot- . lef t  :=  nil; 

Load  (1/  FourRoot"); 

Attach  (FourRoot^,  FourRoot); 

FourRoot^ . balance  :=  Zero; 

While  not  EOF  (FourDix)  do  begin 

Load  ( 1 ,  Entry) ; 

Entry . balance  :=  Zero; 

balanced  :=  false; 

AVL_Insert  (Entry,  FourRoot,  balanced); 
end; 


end; 


62 


procedure  Load  (filenum  : 

var 

sourceltr,  difficultyl 
concretenessltr,  emoti 
vigorltr  :  char; 

begin 

if  (filenum  =  1)  then  R 


else  Readln  (Dixonary, 


with  entry  do  begin 

if  (sourceltr  =  •!') 


integer;  var  entry  :  entrytype); 

tr, 
onltr, 


eadln  (FourDix,  Entry. word, 

sourceltr,  dif f icultyltr, 

concretenessltr,  emotionltr, 

vigorltr) 
Entry .word, sourceltr,  dif f icultyltr, 

concretenessltr,  emotionltr, 

vigorltr) ; 


case  dif f icultyltr  of 

difficulty 

difficulty 

difficulty 

difficulty 

end; 

if  (concretenessltr3 


•h' 
1^1 


then  source  ;=  Latinate 
else  source  :=  Germanic; 

=  Postgrad; 

=  Grad; 

=  High_School; 

=  Elementary; 

'  t*)  then  concreteness 
else  concreteness 


case  emotionltr  of 
emotion 
emotion 
emotion 
emotion 
emotion 

end; 

case  vigorltr  of 


=  Tangible 
=  Intangible; 


's* 
•p« 
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•h« 
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Pleasant; 
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Unpleasant; 
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procedure  AVl_Insert  (Entry  :  entrytype;  var  p  :  Entrypointer; 

var  balanced  :  boolean); 
var 

pl,  p2  :  entrypointer; 
begin 

if  (p  =  nil)  then  begin 
Attach  (Entry,  p); 
P-. balance  :  =  Zero; 
balanced  :=  true; 
end 
else  if  (Entry. word  <  p-.word)  then  begin 

AVL_Insert  (Entry,  p-.left,  balanced); 

If  Balanced  then  (X  left  pointer  has  grown  higher  X) 

case  p-. balance  of 

Plus  :  begin  p-. balance  i-    Zero;  balanced  :=  false;  end; 
Zero    p-. balance  :=  Minus; 
Minus    begin  (x  rebalance  X) 
Pi  :=  p-.left; 

if  (pi-. balance  =  Minus)  then  begin(x  single  LL  rotatx) 
p-.left  :=  pi-. right; 
pl- . right  : =  p; 
p- . balance  :=  Zero; 
p  =  =  pl; 
end  (X  if  X) 

else  begin  (x  double  LR  rotation  X) 
p2  : =  pl- . right; 
pl-. right  ■■  =    p2-.left; 
p2-.left  :=  pl; 
p-.left  :=  p2- . right; 
p2- . right  s  =  p; 

if  (p2-. balance  =  Minus)  then  p- . balance  :s  Plus 
else  p- . balance  :=  Zero; 

if  (p2-. balance  =  Plus)  then  pl-. balance  :=  Minus 
else  pl-. balance  : a  Zero; 
p  ==  p2; 
end;  (X  else  X) 

P-. balance  :s  Zero;  balanced  :=  false; 
end;  (x  case  of  Minus  x) 
end;  (x  of  cases  x) 
end  (X  if  Entry. word  <   p-.word  X) 
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else  if  (  Entry. word  >  p-.word)  then  begin 

AVL_Insert  (Entry,  p- . right,  balan 
if  balanced  then  (X  right  pointer 
case  p-. balance  of 

Minus  :  begin  p-. balance  ■■ - 
balanced  : =  fals 
Zero  :  p-. balance  :=  Plus; 
Plus  :  begin  (x  rebalance  * 
pl  :=  p-. right; 
if  (pl-. balance  =  P 
(X  single  RR  X) 
p- . right  : =  pl- 
pl-.left  :=  p; 
p- . balance  : =  Z 

p  s=  pl; 

end  (X  if  X) 
else  begin  (X  doubl 
P2  :=  pl-.left; 
pl-.left  :=  p2- 
p2-. right  :=  pl 
p- . right  :=  p2- 
p2-.left  :=  p; 
if  (p2-. balance 

p- . balance 
else  p- . balance 
if  (p2-. balance 
pl-  .  balance 
else  pl-.balanc 
p  ==  p2; 
end;  (X  double  RL  r 
p- . balance  :=  Zero; 
end;  (x  case  of  balanc 
end;  (x  of  cases  x) 
end   (x  of  if  Entry. word  >  p-.word  X) 
else  balanced  :=  false; 
end;  (X  of  procedure  AVL  Insert  X) 


ced) ; 

has  grown  higher  x) 

Zero; 
e;  end; 

) 

lus)  then  begin 

.left; 

ero; 

e  RL  rotation  x) 
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!left; 

=  Plus)  then 
:=  Minus 
:=  Zero; 
=  Minus)  then 
:=  Plus 

e  : =  Zero; 

otation  X) 

balanced  :=  false; 
e  =  Plus  X) 


procedure  Init_Reporter; 
begin 

Rewrite  (Report,  'Report  text  a'); 

Writeln  (Report);  Writeln( Report ) ;  Writeln( Report ) ; 

Writeln  (Report,  Inf o . Name_of_Text : 45) ; 

Writeln  (Report,  ' PROFILE1 : 45) ; 

Writeln  (Report);  Writeln( Report) ; 
end; 
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or  (c  in  (  .  *  0  ■  .  .  '9 • . ) ) )  then  begin 
ernum  :=  letternum  +  1; 
letternum  <=  wordlength)  then  n .word( . letternum . )  :=  c; 
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d_of_Sentence; 
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stepcount  +  1; 
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teln;  Writeln;  Writeln;  Writeln; 
tylist  is  now  reading   :  *); 
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Writeln; 
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proce 
var  c 

f 
e 

begin 
tal 
tal 
tal 
f  ou 
if 
if 
tal 
if 

end; 


dure  Store_Intext  (n 
p  :  concordpointer; 
ound  :  boolean; 
ntry  :  entrytype; 


readtype) ; 


ly . totalwords  :  =  tally . totalwords  +  1; 

ly . totalletters  :=  tally . totalletters  +  n.  length; 

ly . numwordsthissent  :=  tally . numwordsthissent  +  1; 

nd  : =  false; 

(n. length  <  5)  then  Look_Up_Four  (n,  FourRoot,  found); 

not  found  then 

ly . LtrsPerWord( . n . length . )  :=  tally . LtrsPerWord( . n . length . )  +  1; 

not  found  then  Look_Up  (n,  Root); 


procedure  Signal_End_of_Sentence; 
begin 

tally .  totalsentences  ••  =  tally .  totalsentences  +  1; 
if  ( tally . totalsentences  >  maxsent)  then  begin 

Writeln  CINTEXT  TOO  BIG.');  Allswell  ==  false; 
end 

else  tal ly. WordsPerSent( . tally . totalsentences . )  := 
tal ly . numwordsthissent; 
tally . numwordsthissent  :=  0; 
end; 


procedure  Signal_End_of_File; 
begin 

if  Inf o . Want_Concord  then  Lastward- . next  :=  nil; 

tally . numf ound  :=  tally . totalwords  -  tally . unknownwords; 

Page;  Writeln;  Writeln;  Writeln;  Writeln; 

Write  ('Stylist  has  finished  reading     '); 

Writeln  ( Inf o . Name_of_text) ; 
end; 


procedure  Look_Up_Four  (n  :  readtype;  var  ancestor  :  entrypointer; 

var  found  :  boolean); 
concordpointer; 
ry  ••  entrytype; 


var  cp 
ent 

begin 
if  (a 
else 
f  ou 
if 
tal 
tal 
if 

end 
else 

else 
end; 


found  :=  false 
n.word)  then  begin 

inert)  then  Add_Values  (ancestor); 


ncestor  =  nil)  then 

if  (ancestor- .word  : 

nd  : =  true; 

(ancestoi — -.vigor  <> 

ly . totalstrucltrs  :=  tally . totalstrucltrs 

ly .structurewords  :=  tally .structurewords 

Inf o . Want_Concord  then 

Attach_Concord_Node  (ancestor,  cp); 
(x  if  x) 

if  (ancestor- .word  >  n.word)  then 
Look_Up_Four  (n,  ancestoi — '.left*  found) 
Look_Up_Four  (n,  ancestor- . right,  found); 


n . length; 
1; 
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proced 
var  cp 
en 
P 

fo 
c 
begin 
if  ( 


ure  Look_Up  (n  :  readtype;  var  ancestor 

concordpointer; 
try  ••  entrytype; 
;  entrypointer; 
und  :  boolean; 
:  char; 


entrypointer) ; 


ancestor  =  n 

Look_Up_Vari 

if  found  the 

tally . va 

New  (anc 

with  anc 

word 

left 

down 

end;  (* 

if  (p-.s 

Add 

with 

s 

s 

d 

c 

e 

v 

end; 

end  else 

a 


end 
else 


end 
else 

else 
end; 


end 

else  if  Info 
Repeat 
Page; 
Writel 
Writel 
Writel 
Writel 
Read  ( 
Until  (c 
if  (c  =  ' 
Entry. 
Get_Va 
Attach 
Tally. 
Add_Va 
end  else 
nd 

else   Atta 

f   Info. Want 

Attach_C 

(*  in  ancest 

if  (ancesto 

if  (ancesto 

tally. 

else  tally. 

if  (ancesto 

Add_Val 

if  Info. Wan 

Attach 
(*  if  x) 
if  (ancesto 
Look_Up  (n, 
Look_Up  (n, 


il)  then  begin 

ants  (n,  p,  found); 

n  begin 

riantwords  »=  tally . variantwords  +  1; 

estor ) ; 

estor-  do  begin 

=  n . word; 

=  nil;  right  : =  nil ; 

=  nil; 
with  ancestor  do  X) 
tatus  <>  not_valuable)  then  begin 
Values  (p); 

ancestoi — ■  do  begin 
tatus  :=  not_storable; 
ource  ;=  p-. source; 
ifficulty  :=  p- . difficulty; 
oncreteness  :  =  p-.concreteness; 
motion  :=  p- . emotion; 
igor  : =  p- .vigor; 
(x  with  ancestor  do  X) 

(x  p- . status  is  other  than  not_valuable  X) 
ncestoi — -.status  t -    not_valuable; 

. Want_Dix_Grow  then  begin 

Writeln;  Writeln;  Writeln; 
n  (n.word:  50);  Writeln;  Writeln; 
('ADD  IT  ?':45);  Writeln;  Writeln; 


) 


? 

CI)   Yes':45); 

(•2)   No  «:45); 

;  Readln; 

«1')  or  (c  =  '2* 

)  then  begin 
word  : =  n . word; 
lues_f or_New_Entry  (Entry); 

(Entry,  ancestor); 
newwords  i=  tal ly . newwords  +  1; 
lues  (ancestor); 
Attach_No_Value  (ancestor,  n); 

ch_No_Value  (ancestor,  n); 

_Concord  then 

oncord_Node  (ancestor,  cp); 

or  =  nil  x) 

r^.word  =  n.word)  then  begin 

i — '.status  <>  not_storable)  then 

knownwords  ••  -    tally .  knownwords  +  1 

unknownwords  :=  tally . unknownwords 

r-. status  <>  Not_Valuable)  then 

ues  (ancestor); 

t_Concord  then 

Concord_Node  (ancestor,  cp); 

i — '.word  >  n.word)  then 
ancestor- . left) 
anc estor-. right); 


+  1; 
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procedure  Attach  (Entry 
begin 

New  (p); 

with  p-  do  begin 

word  : =  entry. word; 

status  :=  storable; 

source  :=  entry .source; 

difficulty  s~    entry . difficulty; 

concreteness  :  =  entry . concreteness; 

emotion  :=  entry . emotion; 

vigor  :=  entry. vigor; 

left  :=  nil;  right  :=  nil; 

down  ; =  nil ; 
end; 
end; 


entrytype;  var  p  :  entrypointer) ; 


procedure  Attach_Concord_Node  (p  : 

var  cp 
var  Bottom, 

Bottommost  :  concordpointer; 
begin 

Bottom  :  =  p-.down; 
while  (Bottom  <>  nil)  do  begin 
Bottommost  :=  bottom; 
Bottom  :=  Bottom- . down; 
end; 

New  (cp); 
With  cp-  do  begin 
Up  :=  p; 

if  (Lastward  =  nil)  then  Last 
else  last  : =  Lastward; 
Down  : =  nil ; 
end; 
if  (P-.down  =  nil)  then  p- .  down  : 

else  Bottommost- . down  :=  cp; 
if  (Lastward  <>  nil)  then 
cp- . last- . next  :=  cp; 
lastward  : =  cp; 
end; 


entrypointer; 

:  concordpointer); 


=  nil 


=  cp 


procedure  Attach_No_Value  (var  ancestor  :  entrypointer; 

n  :  readtype); 
begin 

Writeln  (newwords,  n.word); 

New  (ancestor);  Ancestor- .word  :=  n.word; 

tally . unknownwords  :  =  tally . unknownwords  +  1; 

Ancestor- . right  :=  nil;  Ancestor- . left  :=  nil; 

Ancestor — -.down  :=  nil; 

Ancestoi — -.status    :=    not_valuable; 
end; 
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procedure  Look_Up_Variants  (original 

var  found 
iant  ;  readtype; 
integer; 


readtype;  var  p 
boolean) ; 


entrypointer; 


var 


var 

i   : 

begin 

found 

if  (o 

if  (o 

va 

va 

va 

Lo 


=  false; 
riginal . length  >  3)  then 

riginal .word( . original . length. )  =  's')  then  begin 
riant. word  :=  original .word; 
riant .word( . original . length . )  :-  '  *; 
riant. length  :=  original . length  -  1; 
ok_Up_Variant  (variant/  Root,  found,  p); 


if  not  found  and  (variant .word( .variant . length . )  =  'e')  then 
begin 

vai 

vai 

Lo< 

if 

be< 


iriant .word( .variant . length . )  t-    '  '; 
iriant . length  :=  variant . length  -  1; 
>ok_Up_Variant  (variant,  Root,  found,  p); 

not  found  and  (variant .word( . variant . length. ) 
»gin 


fi')  then 


en 
end; 


Look_Up_Variant  (variant,  Root,  found,  p); 
d; 


end 
els< 


if  (origin 

if  (origin 

( origina 

( origina 

variant . 

for  i  :  = 

varia 

variant . 

Look_Up_ 

if  not  f 

vari 

vari 

Look 

if  n 

( 


end; 
end; 


al . len 
al . wor 
1 .word 
1 .word 
word 

origi 
nt . wor 
length 
Varian 
ound  t 
ant . le 
ant .wo 
_Up_Va 
ot  f ou 
varian 
varian 
vari 
ariant 
ariant 
ook  Up 


gth   > 

d( . ori 
( . orig 
( . orig 
=  orig 
nal  .  le 
d(  .i.) 

:  =  or 
t  (var 
hen  be 
ngth 
rd( . va 
riant 
nd  and 
t . word 
t .word 
ant . le 
. word( 
. word( 

Varia 


4)  then 

ginal . length. )  =  'g')  and 
inal. length  -  1.)  =  'n1)  and 
inal. length  -  2.)  =  'i')  then  begin 
inal .word; 

ngth  -  2  to  original . length  do 
•  =  »  •  • 

iginal . length  -  3; 

iant, Root,  found,  p); 

gin 

=  variant . length  +  1; 

riant . length. )  : =  'e'; 

(variant,  Root,  found,  p); 

(  .variant . length  -  2.)  = 

( .variant . length  -  1.))  then  begin 

ngth  ;=  variant . length  -  2; 

.  variant . length  +  1.)  :=  *  ' 

.  variant . length  +  2.)  : =  '  ' 

nt  (variant,  Root,  found,  p) 


end 
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else  if  (original . length 
if  (original .word( . o 
(original .word( .or 
variant. word  : =  or 
for  i  : =  original . 
variant . word( . i 
variant . length  := 
Look_Up_Variant  (v 
if  not  found  then 
if  (variant. won 
variant .word 
variant . leng 
Look_Up_Vari 
end; 
if  not  found  then 
variant . length 
variant . word( . 
Look_Up_Varian 
if  not  found  a 
(variant .wo 
variant .wo 
variant . 
variant . wor 
variant .wor 
Look_Up_Var 
end; 
end; 


>  4)  then 

riginal . length . )  =  'd*)  and 

iginal . length  -  1.)  =  *e')  then  begin 

iginal .word; 

length  -  1  to  original . length  do 

.)  ==  ■  '; 

original . length  -  2; 

ariant,Root,  found,  p); 


d( .variant. length. ) 
( .variant . length . )  :=  'y'; 
th  ■•  =  variant .  length  -  1; 
ant  (variant, Root,  found,  p); 


• i  * )  then  begin 


begin 

=  variant . length  +  1; 
variant . length . )  :=  'e'; 
t  (variant,  Root,  found,  p); 
nd 

rd( .variant . length  -  2.)  = 
rd( .variant . length  -  1.))  then  begin 
length  :=  variant . length  -  2; 
d(  .variant . length  +  1.)  :=  '  * ; 
d(  .variant . length  +  2.)  :=  *  '; 
iant  (variant,  Root,  found,  p); 


end 

else 


endj 
end; 


if  (original . length 
if  ( original .word( . o 
(original .word( .or 
variant. word  : -    or 
for  i  : =  original . 
variant .word( .i 
variant . length  := 
Look_Up_Variant  (v 
if  not  found  then 
variant . length 
variant . word( 
variant . word( 
Look_Up_Varian 
if  not  found  a 
(variant .wo 
variant . 
variant .wor 
variant .wor 
variant .wor 
Look_Up_Var 
end; 
end; 


>  4)  then 

riginal . length . )  =  fy')  and 

iginal . length  -  1.)  =  *  1')  then  begin 

iginal . word; 

length  -  1  to  original . length  do 

.)  :=  '  •; 

original . length  -  2; 

ariant,Root,  found,  p); 

begin 

variant . length  +  2; 
variant . length . )  :=  'e'; 
variant . length  -  1.)  :=  '1'; 
t  (variant,  Root,  found,  p); 
nd 

rd( . variant . length  -  2.)  =   'i')  then  begin 
length  : =  variant . length  -  2; 
d( . variant . length . )  :  =  'y'; 
d( .variant . length  +  1.) 
d( .variant . length  +  2.) 
iant  (variant,  Root,  found,  p) 


t  i 
i  i 
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procedure  Look_Up_Variant  (variant  :  readtype; 

var  ancestor  :  Entrypointer; 
var  Found  :  Boolean; 
var  p  :  Entrypointer); 


begin 
if  (a 
else 


ncestor  =  nil)  then  found  :  =  false 

if  (ancestor^ .word  =  variant .word)  then  begin 

found  : =  true; 

p  : =  ancestor; 
end 
else  if  (ancestoi — '.word  >  variant .word)  then 

Look_Up_Variant  (variant,  ancestoi — '.left,  found,  p) 
Look_Up_Variant  (variant,  ancestoi — -.right,  found,  p) 


else 
end; 


proced 
begin 
if  ( 

T 

else 

case 

P 

G 

H 

E 

end; 

if  ( 

else 

case 
S 
P 
N 
U 
H 

end; 

case 
V 
E 
C 
I 

end; 
end; 


ure  Add_Values  (p  :  entrypointer); 


p^ .source 
ally . NumLa 
Tally. Num 
P- . dif f ic 
ostGrad 
rad 

igh_School 
lementary 

P^ . concret 
Tal ly . Num 
Tally . Num 
P^ . emotio 

ubl ime 

leasant 

eutral 

npleasant 

orrid 

P~ . vigor 
iolent 
nergetic 
aim 
nert 


=  Lat 

tinat 
Germa 
ulty 
Ta 

Ta 
Ta 
Ta 

eness 
Intan 
Tangi 
n  of 
Tal 
Tal 
Tal 
Tal 
Tal 


inate)  then 

e  :=  Tal ly . NumLatinate  +  1 

nic  :=  Tally . NumGermanic  +  1; 

of 

Hy.NumPostGrad  ==  Tally .  NumPostGrad  +  1; 

lly.NumGrad  :=  Tally . NumGrad  +  1; 

lly . NumHigh_School  :=  Tally . NumHigh_School  +  1; 

lly . NumElementary  :=  Tally . NumElementary  +  1; 

=  Intangible)  then 
gible  :=  Tal ly . Numlntangible  +  1 
ble    :=  Tally . NumTangible  +  1; 


ly . NumSublime 
ly . NumPleasant 
ly . NumNeutral 


=  Tally . NumSublime  +  1; 

:=  Tally. NumPleasant  +  1; 

=  Tally. NumNeutral  +  1; 
ly . NumUnpleasant  :=  Tal ly . NumUnpleasant  +  1; 
ly . NumHorrid  :=  Tal ly . NumHorrid  +  1; 


of 


Tal 
Tal 
Tal 
Tal 


ly . Numviolent  :=  tal ly . numviolent  +  1; 
ly . NumEnergetic  :=  Tally . NumEnergetic  +  1; 
ly.NumCalm  s=  Tally . NumCalm  +  1; 
ly.Numlnert  i~    Tally . Numlnert  +  1; 
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procedure  Get 
var  c  :  char; 
good_char 
begin 

Repeat 
Page; 
Write 
Write 
Write 
Write 
Write 
Write 
Read 
Until  (c 
if  (c  = 

Repeat 

Page; 

Write 

Write 

Write 

Write 

Write 

Write 

Write 

Write 

Read 

Until  (c 

case  c  o 

'1« 
t2t 

'3' 
'4' 

end; 

Repeat 
Page; 
Wri  te 
Write 
Write 
Write 
Read( 

Until  (c 

if  (c  = 

Repeat 

Page; 

Write 

Write 

Write 

Write 

Write 

Write 

Write 

ReadC 

Until  (c 

case  c  o 

•l1 
i2i 

'3' 
141 

i5i 

end; 


_Values_f or_New_Entry  (var  Entry  :  entrytype); 
:  boolean; 


In; 

In 

In 

In; 

In 

In 

(c) 


In; 

In 

In; 

In 

In 

In 

In 

In 

(c) 


Writeln;  Writeln; 
(■SOURCE  ' :45);  Writeln;  Writeln; 
( Entry .word : 45) ; 

Writeln; 
CI)  LATINATE  •  :45); 
( '2)  GERMANIC  ' :45); 
;  Writeln  (c=45);  Readln; 
'1')  or  (c  =  '2'); 
)  then  Entry. source  :=  Latinate 
else  Entry. source  :=  Germanic; 


Writeln;  Writeln; 
( 'DIFFICULTY  LEVEL  ' :45); 

Writeln; 
( Entry .word : 45) ;  Writeln; 
CI)  POSTGRADUATE' :45) 


»2)  GRADUATE 
'3)  HIGH  SCHOOL 
'4)  ELEMENTARY 
Writeln  ( c : 45) 
»1')  or  (c  =  '2* 


:45) 
:45) 
:45) 

Readln; 
or  (c  = 


Writeln; 


31)  or  (c  =  '4'); 


Entry . difficulty 
Entry. difficulty 
Entry. difficulty 
Entry. difficulty 


PostGrad; 
Grad; 

High_School ; 
Elementary; 


Writeln;  Writeln; 
In  ('CONCRETENESS 
In  ( Entry . word : 45) 
In  CI)  TANGIBLE  '  =45); 
In  ( f2)  NOT  TANGIBLE  '  :45); 
c);  Readln;  Writeln;  Writeln 

=  »1')  or  (c  =  '2'); 
'1')  then  Entry . concreteness 
else  Entry . concreteness 

Writeln;  Writeln;  Writeln; 


Writeln; 

:45);  Writeln;  Writeln; 
Writeln;  Writeln; 


In 
In 
In 
In 
In 
In 
In 

C); 


('EMOTIONAL  CONNOTATION 


( Entry. word : 45) 
CI)  SUBLIME 

PLEASANT 

NUETRAL 

UNPLEASANT 

HORRID 
Readln;  Writeln; 


'2) 
•3) 
•4) 
'5) 


1') 


or 


(c  =  '2') 


Writeln; 

45); 

45); 

45); 

45); 

45); 
Writeln 
or  (c  = 


(c:45); 

: =  Intangible 
:=  Tangible; 


45);  Writeln; 
Writeln; 


Writeln; 


(c:45); 
•3')  or 


(c  =  '4')or(c  =  '5') 


Entry. emotion 
Entry. emotion 
Entry. emotion 
Entry. emotion 
Entry. emotion 


Sublime; 
Pleasant; 
Neutral ; 
Unpleasant; 
Horrid; 


73 


Repeat 

Page;  Writeln;  Writeln;  Writeln; 
Writeln  ( 'VIGOR1 -.45) ;  Writeln;  Writeln; 


Writeln  (  Entry  .word  •■  45) ; 

Writeln; 

Writeln; 

Writeln  CD  VIOLENT    ' 

45); 

Writeln  ( '2)  ENERGETIC  ' 

45); 

Writeln  C3)  CALM       ' 

45); 

Writeln  ('4)  NONE      * 

45); 

Read  (c);  Writeln  (c:45);  Readln; 

Until  (c  =  '1')  or  (c  =  '2')  or  (c  = 

•3')  or  ( 

case  c  of 

•1' 

Entry .vigor 

=  Violent; 

i2i 

Entry .vigor 

=  Energetic; 

'3» 

Entry. vigor 

=  Calm; 

•4' 

Entry . vigor 

=  Inert; 

end; 

(c 


4'); 


end; 


proced 
begin 
Page 
Writ 
Writ 
Writ 
Writ 
Writ 
Writ 
Writ 
Writ 
Writ 
Inor 
end; 


ure  Wri te_Concordance; 

;  Writeln;  Writeln;  Writeln;  Writeln; 

e  ('Stylist  is  now  writing  the  concordance  for 

eln  ( Inf o . Name_of_text ) ; 

eln;  Writeln; 

eln  ('Concordance  will  be  written  to  "Concrdnc 

eln;  Writeln; 

eln  (Concrdnc);  Writeln  (Concrdnc); 

eln  (Concrdnc,  'CONCORDANCE' : 55) ; 

eln  (Concrdnc,  Inf o . Name_of_text : 55 ) ; 

eln  (Concrdnc);  Writeln  (Concrdnc); 

der  Concordance  (Root); 


text  a"' :50) 
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procedure  Inorder_Concordance  (var  Root  :  entrypointer) ; 

(x  This  is  the  most  complex  procedure  in  the  program.  It  traverses  X) 
(x  the  Dixonary  Binary  Search  Tree  (BST)  in  inorder  fashion,  this  x) 
(X  visiting  each  important  word  used  in  the  Intext  in  alphabetic  X) 
(X  order.  During  the  visit  to  each  word,  it  goes  down  the  concord  x) 
(X  list  one  at  a  time,  thus  examining  each  use  of  that  word  in  the  x) 
(X  order  it  was  used  in  the  Intext.  At  each  visit  down  the  concord  X) 
(X  list,  it  follows  the  Lastward  linked  list  to  find  the  phrase  in  X) 
(X  the  Intext  that  preceded  that  instance  of  the  use  of  the  word,  x) 
(X  prints  the  phrase, capitalizes  and  prints  the  word,  then  follows  X) 
(X  the  Nextward  linked  list  to  find  and  print  the  phrase  following,  x) 
var  Lastward,  (X  pointer  to  word  previous  in  intext  X) 

Nextward,  (X  pointer  to  word  following  in  intextX) 

Downward  :  concordpointer;  (X  pointer  to  next  use  of  same  word    x) 
Phrasel,  (X  phrase  preceding  key  word  in  concordx) 

Phrase2  :  Phrasetype;       (X  phrase  following  key  word  in  concordx) 
w,  (X  #  of  word  being  printed  x) 

1,  (X  f  of  letter  in  word  being  printed  X) 

offset, 
i     integer; 
c    char; 
begin 

offset  :=  ord  ('A')  -  ord  ('a'); 
if  (Root  <>  nil)  then  begin 

Inorder_Concordance  (Root-. left); 
Downward  :=  Root-. down; 
While  (Downward  <>  nil)  do  begin 
w  :  =  0; 

Lastward  ■•  =  Downward- .  last ; 

while  (Lastward  <>  nil)  and  (w  <  Phraselength)  do  begin 
w  :  =  w  +  1 ; 

PhraseK.w.)  :=  Lastward- .  up- .word; 
Lastward  :=  Lastward- . last; 
end; 

for  i  •.  =  w  downto  1  do  begin 
1  ••  =  1; 

while  (1  <=  wordlength)  do  begin 
if  (PhraseK .i.)( .1. )  <>  •  •)  then 

Write  (Concrdnc,  Phrasel (. i .)(. 1 .)) ; 
1  ••=  1  +  1; 
end; 

Write  (Concrdnc,  *  *:1); 
end; 

Write  (Concrdnc,  '    •); 
1  ==  1; 

while  (1  <=  wordlength)  do  begin 
c  :=  Root- . word( . 1 . ) ; 
if  (c  in  (  .  'a'  .  .  'z' .))  then 

c  :=  chr(ord(c)  +  offset); 
if  (Root-.word( .1.)  <>  •  ')  then 

Write  (Concrdnc,  c); 
1  :=  1  +  1; 
end; 

Write  (Concrdnc,  *    '); 
w  :  =  0; 

Nextward  ••-    Downward- .  next; 
while  (Nextward  <>  nil)  and 

(w  <  Phraselength)  do  begin 
w  :  =  w  +  1 ; 

Phrase2(.w.)  :=  Nextward- . up- .word; 
Nextward  :=  Nextward- . next; 
end; 
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for  i  : =  1  to  w  do  begin 
1  :=  1; 
while  (1  <=  wordlength)  do  begin 

if  (Phrase2(.i.)(.l.)  <>  '  •)  then 

Write  (Concrdnc,  Phrase2( . i . ) ( . 1 . ) ) ; 
1  :=  1  +  1; 
end; 

Write  (Concrdnc,  '  ':1); 
end; 

Writeln  (Concrdnc); 
Downward  :=  Downward- . down ; 
end; 

Inorder_Concordance  (Root-. right); 
end;  (x  if  Root  <>  nil  X) 
end;    (x  procedure  Inorder  Concordance  X) 
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procedure  Calculate_Prof ile; 

var  numsigltrs  :  integer;    (*  number  of  significant  letters  *) 

numsigwrds  :  integer;    (*  number  of  significant  words  *) 
(X  in  this  context,  structureal  words  are  insignificant  *) 
begin 

if  (tally . totalsentences  =  0)  then  tal ly . totalsentences  :=  1; 
if  (tally . totalwords  =  0)  then  tally . totalwords  :=  1; 
numsigwrds  :=  tally . totalwords  -  tally . structurewords; 
numsigltrs  :=  tally . totalletters  -  tally . totalstrucltrs; 
Profile  :=  tally; 


I  I  \J   I  J.  ±   c   •      V.C3  x  i  y  » 

prof ile . Ave_ltrs_per_word  : a  numsigltrs/numsigwrds; 

profile  .  Ave_wrds_per_sent  •■  -    tally,  to  tal  words/ tally  .totalsentences; 


ena; 


procedure  Analyze_Prof ile_and_Report;   (*  main  Reporter  procedure  x) 
begin 

Page;  Writeln;  Writeln;  Writeln;  Writeln; 

Write  ('Stylist  is  now  analyzing  the  style  of  '); 

Writeln  ( Inf o . Name_of_text) ; 

Writeln;  Writeln; 

Writeln  ('Report  of  analysis  will  be  written  to  "Report  text  a*":50); 

Writeln;  Writeln; 

Writeln  (Report);  Writeln  (Report); 

Writeln  (Report);  Writeln  (Report); 

Calculate_Size_of_Text  (profile,  analysis); 

Calculate_Length_of_Words  (profile,  analysis); 

Calculate_Length_of_Sentences  (profile,  analysis); 

Calculate_Etymology  (profile,  analysis); 

Calculate_Dif f iculty_of_Vocabulary  (profile,  analysis); 

Calculate_Tangibili ty  (profile,  analysis); 

Calculate_Emotional_Tone  (profile,  analysis); 

Calculate_Vigor_of_Words  (profile,  analysis); 

Make_Recommendations  (analysis); 
end; 
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procedure  Calculate_Size_of_Text  (profile  ••  profiletype; 

var  analysis  :  analysistype) ; 
var  i  , 

PerCentFound  :  integer; 
begin 

Writeln(Report);  Writeln( Report) ;  WritelnC  Report)  ; 

Write  (Report,  'Total  of  sentences  :  '); 

Writeln  (Report,  prof ile . Totalsentences) ; 

Write  (Report,  'Total  of  words      :  '); 

Writeln  (Report,  prof ile . Totalwords) ; 

Write  (Report,  'Total  of  letters    :  '); 

Writeln  (Report,  prof ile . totalletters) ; 

Writeln  (Report);  Writeln( Report ) ;  Writeln( Report ) ; 

if  ( prof ile . totalwords  <  500)  then  begin 

Write  (Report,  Prof ile . totalwords,  '  words  are  '); 

Writeln  (Report,  'too  few  for  valid  statistical  analysis.1); 
end 
else  if  ( prof i le . totalwords  <  1000)  then  begin 

Write  (Report,  Prof i 1 e . totalwords,  *  words  are1); 

Writeln  (Report,  *  enough  for  valid  statistical  analysis.1); 
end 
else  begin 

Write  (Report,  Prof ile . totalwords,  '  words  are  '); 

Writeln  (Report,  '   plenty  for  valid  statistical  analysis.'); 
end; 

Writeln  (Report);  Writeln( Report) ;  Writeln( Report) ; 
Write  (Report,  'Of  the  ',  prof ile . totalwords : 6 ,  '  words   in  '); 
Writeln  (Report,  Inf o . Name_of_Text,  ','); 
Write  (Report,  prof i le . numf ound : 6 ) ; 

Writeln  (Report,'  were  matched  to  words  in  the  Stylist  dictionary.1); 
percentfound  :=  prof i 1 e . numf ound  x  100  div  prof ile . totalwords; 
Writeln  (Report); 

Writeln  (Report,  Percentfound :  2,  '  '/.      were  matched.'); 
if  (percentfound  <  50)  then  begin 

Write  (Report,  Inf o . Name_of_text) ; 

Writeln  (Report,  '  must  contain  many  specialty  or  unique  words.'); 

Writeln  (Report,  'Statistical  analysis  is  not  valid.'); 
end 

else  Writeln  (Report,  'This  is  enough  for  statistical  analysis.'); 
if  ( prof i le . newwords  >  0)  then  begin 

Write  (Report,  'You  added  ',  prof ile . newwords) ; 

Writeln  (Report,'  to  the  dictionary  during  this  session.'); 
end; 
end;  (*  procedure  Calculate  Size  of  intext  x) 
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procedure  Calculate_Length_of_Words  (profi 

var  a 
begin 

Graph_LtrsPerWord  (profile. ltrsperword) ; 
Writeln  (Report);  Writeln  (Report); 
Write  (Report,  'The  average  number  of  le 
Writeln  (Report,  prof ile . Ave_l trs_per_wo 
Writeln (Report) ;  Writeln(Report); 
if  (Info. Genre  =  NonFiction)  then  begin 
Writeln  (Report,  *A  typical  nonfictio 
Write  (Report,  'resembles  a  low  bell- 
Writeln  (Report,  'centered  around  six 
if  ( prof i le . Ave_L trs_Per_Word  >  7)  th 

Analysis . WordLength  :=  Toolong 
else  if  ( prof ile . Ave_Ltrs_Per_Word  > 

Analysis . WordLength  :=  Long 
else  if  ( prof i le . Ave_Ltrs_Per_Word  > 

Analysis . WordLength  :=  medium 
else  if  ( prof ile . Ave_L trs_Per_Word  > 

Analysis . WordLength  :=  short 
else  Analysis . WordLength  :=  tooshort 
end 


le  :  profiletype; 
nalysis  ■    analysistype) ; 


tters  per  word      *); 
rd); 


n  texts  distribution  '); 
shaped  curve  ' ) ; 

letters/word. ' ) ; 
en 

6.5)  then 

5.5)  then 

5)  then 


else  begin  (*  genre 
Writeln  (Report, 
Write  (Report,  ' 
Writeln  (Report, 
if  (profile. Ave 


is  fiction  x) 
'A  typical  fiction  t 
resembles  a  tall  bel 
'centered  around  fiv 
Ltrs  Per  Word  >  6)  th 


Analysis . WordLength  :=  Toolong 
else  if  ( prof ile . Ave_Ltrs_Per_Word  > 

Analysis . WordLength  :=  Long 
else  if  ( prof ile . Ave_Ltrs_Per_Word  > 

Analysis . WordLength  :=  medium 
else  if  ( prof ile . Ave_Ltrs_Per_Word  > 

Analysis . WordLength  :=  short 
else  Analysis . WordLength  :=  tooshort; 
end; 

Writeln  (Report); 

Write  (Report,  'The  length  of  the  words 
case  Analysis . WordLength  of 

TooShort  :  Writeln  (Report,  'too  shor 
Short  :  Writeln  (Report,  'short.'); 
medium  :  Writeln  (report,  'medium'); 
Long  :  Writeln  (Report,  'long.'); 
Toolong  :  Writeln  (Report,  'too  long, 
end; 

Writeln  (Report);  Writeln( Report ) ;  Write 
end;  (*  calculate  length  of  words  *) 


exts  distribution  *); 
1-shaped  curve  '); 
e  letters/word.'); 
en 

5.5)  then 

5)  then 

4)  then 


is  '); 
t.'); 

'); 

ln( Report) ; 
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procedure  Graph_Ltrsperword  ( ltrsperword  :  ltrsperwordtype); 
var  perltrwords  :  ltrsperwordtype; 

i,  j  •.  integer; 
begin 

Page  (Report); 

WritelnC Report) ;  WritelnC Report ) ;  Writeln(Report) ; 

Writeln  (Report, ' BREAKDONN  OF  PERCENT  OF  LETTERS  PER  WORD':60); 

Writeln( Report) ;  Writeln( Report); 

for  i  : =  0  to  wordlength  do 

PerLtrWords( .i . )  :=  ( LtrsPerWord( . i . )*100 )  div 

( tal ly . total words  -tally . st rue tu rewords) ; 
for  i  :=  15  downto  1  do  begin 

Write  (Report,  i*2=2,  '    '=<♦); 
for  j  : =  1  to  wordlength  do 

if  (PerLtrWordsC . j .)  >=  i*2)  then 

Write  (Report, '   X   ' :5) 
else  if  (PerLtrWords( . j . )  >=  (i*2)-l)  then 

Write  (Report,  '   .   ■  :5) 
else 

Write  (Report, •      • :5); 

Writeln  (Report); 
end; 

Write  (Report,  ■     '  :  4)  ; 
for  i  : =  1  to  wordlength  do 

Write  ( Report , i  :  5) ; 
Writeln  (Report); 

Writeln  ( Report, ' Number  of  Letters  in  a  Word':45); 
end; 
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procedure  Calculate_Length_of_Sentences  (profile  :  profiletype; 

var  analysis  :  analysistype) ; 
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:  integer; 

profile.WordsPerSent); 

average  number  of  words 

f i le . Ave_wrds_per_sent ) ; 

teln( Report ) ;  Writeln( Report ) ; 

pical  modern  texts  sentences  average 

etween  fifteen  and  twenty  words.1); 

_per_sent  >  22)  then 

gth  : =  Toolong 

18) 


_wrds_per_sent  >  18)  then 
gth  : =  Long 

_wrds_per_sent  >  15)  then 
gth  : =  Medium 
_wrds_per_sent  >  10)  then 
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rof ile . totalsentences  do 

. wordspersent( . i . )  >  45)  then 

=  runons  +  1 

of ile . wordspersent( . i . )  >  8)  and 

of ile . wordspersent( . i . )  <  20)  then 

ts  :=  sweetspots  +  1; 

runonsXlOO  div  prof ile . totalsentences; 

>  5)  then  Analysis . Runons  :=  unacceptable 
unOns  >  0)  then  Analysis . Runons  :=  acceptable 
nons  :=  nonexistant; 


'Number  of  run  ons 
'Percent  of  run  ons 


runons  :  4) ; 
percentrunons:4); 


'Run  ons 
iunons  of 


are 


); 


Writeln  (Report,  'nonexistant.'); 
Writeln  (Report,  'acceptable.'); 
:  Writeln  (report,  'unacceptable'); 


ts  :=  sweetspotsXlOO  div  prof ile . totalsentences; 
tspots  >  50)  then  Analysis . Modulation  i -    Good 
tSweetspots  >  20)  then  Analysis . Modulation  :=  Average 
odulation  :=  bad; 
); 


); 


'Number  of  medium  length  sentences 
•Percent  of  medium  length  sentences 
pe re en tswee tspots :  4) ; 


*,  sweetspots : 4) ; 
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81 


Write  (Report,  'Modulation  is  '  ) ; 
case  Analysis .Modulation  of 

Good  :  Writeln  (Report,  •  good.'); 
Average  -.    Writeln  (Report,  'average'); 
Bad  :  Writeln  (report,  'bad'); 
end; 
end;  (*  procedure  analyze  length  of  sentences  *) 
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procedure  Graph_WordsPerSent  (Graph  :  WordsPerSenttype) ; 

var  i,  j ,  pagenum  :  integer; 

begin 

Pagenum  •■-    1; 

While  (Tally . totalsentences  >  (Pagenum  -  1)  X  70)  do  begin 
Page  (Report); 

Writeln  (Report, • BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE' i 6 0 ) ; 
Writeln  (Report, *  Number  of  words'); 
for  i  :=  1  X  ((Pagenum  -  1)  X  70)  to  70  *  (Pagenum)  do 

GraphC.i.)  :=  (GraphC.i.)  div  2); 
for  i  ;=  50  downto  1  do  begin 
Write  (Report, iX2:2,  '  ':!); 

for  j  :=  1  X  ((Pagenum  -1)  x  70)  to  70  X  (Pagenum)  do 
if  (Graph(.j.)  >=  i)  then 

Write  (Report,  "X'-.D  else  Write  (Report,  f  »:1); 
Writeln  (Report); 
end; 

Write  (Report,  '    '); 
for  i  :=  1  to  7  do 

Write  (Report,  i  x  10  +  ((Pagenum  -  1)  X  70):10); 
Writeln  (Report); 

Writeln  (Report,  'Sentence  Number  ':  45)  ; 
Writeln  (Report); 
Pagenum  :-    Pagenum  +  1; 
end; 
end; 

procedure  Calculate_Etymology  (profile  :  profiletype; 

var  analysis  :  analysistype) ; 
begin 

Writeln  (Report);  Writeln( Report) ;  Writeln(Report) ; 

Writeln  (Report,  'ETYMOLOGY  OF  WORDS'  :<+5); 

Writeln( Report);  Writeln( Report); 

Write  (Report,  'Number  of  Latinate  words    :  *); 

Writeln  (Report,  prof ile . NumLatinate) ; 

Write  (Report,  'Number  of  Germanic  words    :  '); 

Writeln  (Report,  prof ile . NumGermanic) ; 

Writeln  (Report); 

if  (Info. Genre  =  NonFiction)  then  begin 

if  (prof ile . NumLatinate  >  prof ile . NumGermanic  x  3)  then 

Analysis . Etymology  :  =  TooBorrowed 
else  if  ( prof ile . NumLatinate  >  prof ile . NumGermanic  X  2)  then 

Analysis . Etymology  :=  Borrowed 
else  if  ( prof ile . NumLatinate  >  prof ile . NumGermanic)  then 

Analysis . Etymology  :=  Mixed 
else  if  ( prof i le . NumLatinate  >  prof ile . NumGermanic  X  0.9)  then 

Analysis  .  Etymology  •.-    Native 
else  Analysis . Etymology  : s  TooNative; 
end 
else  (X  genre  =  Fiction  X)  begin 

if  ( prof ile . NumLatinate  >  prof i le . NumGermanic  X  2)  then 

Analysis . Etymology  :=  TooBorrowed 
else  if  (prof ile . NumLatinate  >  prof ile . NumGermanic  x  1.5)  then 

Analysis . Etymology  :=  Borrowed 
else  if  (prof ile . NumLatinate  >  prof ile . NumGermanic)  then 

Analysis . Etymology  :=  Mixed 
else  if  (prof ile . NumLatinate  >  prof ile . NumGermanic  X  0.5)  then 

Analysis. Etymology  : -    Native 
else  Analysis. Etymology  : =  TooNative; 
end; 

Write  (Report,  'Etymology  is  '); 
case  Analysis . Etymology  of 

TooBorrowed  :  Writeln  (Report,  'very  borrowed.'); 
Borrowed  :  Writeln  (Report,  'borrowed.'); 
Mixed  ••  Writeln  (report,  'mixed'); 
Native    Writeln  (Report,  'native.'); 
TooNative  :  Writeln  (Report,  'very  native.'); 
end; 
end;  (x  procedure  calculate  etymology  X) 
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procedure  Calculate_Dif f iculty_of_Vocabulary  (profile  :  profiletype; 

var  analysis  :  analysistype) ; 
var 

PerCentPostGrad, 
PerCentGrad, 
PerCentHigh  :  integer; 
begin 

Writeln  (Report);  Writeln(Report) ;  Writeln( Report) ; 
Writeln  (Report,  'DIFFICULTY  OF  VOCABULARY •: 45) ; 
Writeln( Report);  Writeln( Report); 

Write  (Report,  '   PostGraduate  difficulty     :  '); 
Writeln  (Report,  prof ile . NumPostGrad) ; 
Write  (Report,  '   Graduate  difficulty         :  '); 
Writeln  (Report,  prof ile . NumGrad) ; 

Write  (Report,  *   High  School  difficulty      :  '); 
Writeln  (Report,  prof i le . NumHigh_School ) ; 
Write  (Report,  *   Elementary  difficulty       :  *); 
Writeln  (Report,  prof ile . NumElementary) ; 
Writeln (Report);  Writeln( Report); 

percentPostgrad  :=  prof i le . numpostgrad*10Q  div  prof ile . numfound; 
percentGrad  ■ -    prof i le . numgrad*l 00  div  prof i le . numfound; 
percenthigh  t  =  prof ile . numhigh_school*100  div  prof  ile  .  numfound; 
if  (percentPostgrad  >  0)  then  analysis . HardWords  i ~    veryhard 
else  if  (percentgrad  >  5)  then  analysis . HardWords  :=  hard 
else  if  (percenthigh  >  10)  then  analysis . HardWords  :  =  challenging 
else  analysis . HardWords  :=  easy; 
Write  (Report,    'Percent  of  Postgraduate  difficulty  *); 
Writeln  (Report,  percentpostgrad) ; 

Writeln  (Report,  'Percent  of  Graduate  difficulty      ',  percentgrad); 
Writeln  (Report,  'Percent  of  High  School  difficulty   '  ,  percenthigh); 
Writeln( Report);  Writeln( Report); 
Write  (Report,  'Difficulty  is  '); 
case  Analysis . HardWords  of 

Veryhard:  Writeln  (Report,  'very  hard'); 
hard:  Writeln  (Report,  'hard'); 
challenging:  Writeln  (report,  'challenging'); 
easy  :  Writeln  (Report,  'easy.'); 
end; 
end;  (X    Calculate_Dif f iculty_of_Vocabulary  X) 
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procedure  Calculate_Tangibility  (profile  :  profiletype; 

var  analysis  :  analysis-type); 
begin 

Writ 

Writ 

Writ 

Writ 

Writ 

Writ 

Writ 

if  ( 
i 


e 
end 
else 

i 


Writeln (Report) ;  Writeln (Report) ; 
•TANGIBILITY' :45); 
Writeln(Report) ; 

1   Number  of  Tangible  words     ='); 
profile . NumTangible) ; 

'   Number  of  Intangible  words   :'); 
profile. N urn In tangible); 
NonFiction)  then  begin 
f  (prof ile . Numlntangible  >  prof ile . NumTangible  *  4) 

Analysis . Tangibility  :=  Soft 
lse  if  (prof ile . Numlntangible  >  prof ile . NumTangible 

Analysis . Tangibility  :=  Firm 
lse  Analysis . Tangibility  :=  Solid; 


eln  (Report) 
eln  (Report, 
eln(Report) ; 
e  (Report/ 
eln  (Report, 
e  (Report, 
eln  (Report, 
Info. Genre  = 


then 

*  2)  then 


(X  genre  =  Fiction  X)  begin 
f  (prof ile . Numlntangible  >  prof ile . NumTangible  X  3)  then 

Analysis . Tangibility  :=  Soft 
lse  if  (prof ile . Numlntangible  >  prof ile . NumTangible  X  1.5)  then 

Analysis . Tangibility  :  =  Firm 
lse  Analysis . Tangibility  :=  Solid; 


eln  (Report); 

e  (Report,  'Tangibility  is 
Analysis . tangibility  of 


e 
end; 
Writ 
Writ 
case 
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end; 
end;  (x  calculate  tangibility  x) 
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.oft 
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(Report, 
( Report, 
( Report, 


); 

very  tangibible . * ) ; 

tangibible . ' ) ; 

very  intangibible . * ) ; 
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procedure  Calculate_Emotional_Tone  (profile  :  prof iletype; 

var  Analysis  :  analysistype) ; 
var 

PerCentSublime, 
PerCentPleasant, 
PerCentUnpleasant, 
PerCentHorrid, 
Emotionlndex  :  integer; 
begin 

Writeln  (Report);  Writeln( Report ) ;  Writeln( Report) ; 
Writeln  (Report,  'EMOTIONAL  CONNOTATIONS  '  :  45) ; 
Nriteln( Report) ;  Writeln( Report); 
Write  (Report,  ■   Sublime  connotations     :  '); 
Writeln  (Report,  prof i 1 e . NumSublime) ; 
Write  (Report,  *   Pleasant  connotations    :  '); 
Writeln  (Report,  prof i 1 e . Numpleasant ) ; 
Write  (Report,  '   Neutral  connotations     :  '); 
Writeln  (Report,  prof ile . NumNeutral ) ; 
Write  (Report,  '   Unpleasant  connotations  i  *); 
Writeln  (Report,  prof ile . Numunpleasant) ; 
Write  (Report,  '   Horrid  connotations      :  '); 
Writeln  (Report,  prof ile . NumHorrid) ; 
Writeln( Report ) ;  Writeln( Report ) ;  Writeln(Report) ; 
PercentSublime  :=  prof i le . numsublimeXl 00  div  prof ile . numfound; 
percentPleasant  :a  prof i le . numpleasantxiOO  div  prof ile . numfound; 
percentUnpleasant  : ~    prof ile . numunpleasantxiOO  div  prof ile . numfound; 
percenthorrid  :=  prof ile . numhorridxiOO  div  prof ile . numfound; 
Emotionlndex  :=    (percentsublime  *  5)  + 

( percentpleasant  X  2)  + 
(percentunpleasant  x  2)  + 
(percenthorrid  X  5); 
if  (Emotionlndex  >  20)  then  Analysis . emotionality  : =  rich 
else  if  (Emotionlndex  >  10)  then  Analysis . emotionality  :=  standard 
else  Analysis . emoti onali ty  :  =  poor; 

Writeln  (Report,  'Percent  of  sublime  connotations     ', PercentSublime) ; 
Writeln  (Report,  'Percent  of  pleasant  connotations    ', percentPleasant) ; 
Write    (Report,  'Percent  of  unpleasant  connotations  '); 
Writeln  (Report,   percentUnpleasant); 

Writeln  (Report,  'Percent  of  horrid  connotations      ', percenthorrid) ; 
Writeln  (Report); 

Writeln  (Report,  'Index  of  Emotionality    ',  Emotionlndex); 
Writeln  (Report); 

Write  (Report,  'Emotionality  is  '); 
case  Analysis . Emotionality   of 

Rich;  Writeln  (Report,  'Rich'); 
Standard    Writeln  (Report,  'average'); 
Poor;  Writeln  (report,  'poor'); 
end; 
if  (((percentsublime  X  5)  +percentpleasant )   > 

((percenthorrid  x  5)  +   percentunpleasant)  X  1.2)  then 
Analysis . Tone  :=  positive 
else  if  (((percentsublime  X  5)  +percentpleasant )  X  1.2  < 
((percenthorrid  X  5)  +   percentunpleasant))  then 
Analysis . Tone  :=  negative 
else  Analysis . Tone  :=  bland; 
Writeln  (Report); 
Write  (Report,  'Tone  is  '); 
case  Analysis. Tone  of 

Positive:  Writeln  (Report,  'Positive'); 

Bland:  if  (analysis . emotionality3  rich)  then  begin 

Write  (Report,  'a  balance  of  strong  positive  '); 
Writeln  (Report,  'and  strong  negative  emotions.'); 
end  else  Writeln  (Report,  'bland.'); 
negative;  Writeln  (report,  'negative'); 
end; 
end;  (x  procedure  calculate  emotional  tone  X) 
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procedure  Calculate_Vigor_of_Words  (profil 

var  an 


e  :  prof iletype; 
alysis  :  analysis-type) ; 


var 
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i  •); 


n(Report ) ; 

iv  prof ile . numfound; 

00  div  prof ile . numfound; 

f ile . numfound; 


is. Strength  :=  VeryStrong 
Analysis . Strength  :=  strong 
Analysis . Strength  :=  lively 


of  extreme  vigor 


of  much 
); 
of  some 


vigor 
vigor 


'); 
'); 
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'Index  of  Vigor 


itelnC 

ite  (R 

se  Ana 

VeryS 

strong:  Writ 
lively:  Writ 
weak    Write 
end; 
end;  (X  procedure  calculate  vigor  of  words  X) 


Strengthlndex) ; 


'Vigor  is  ' ) ; 

trength  of 

Writeln  (Report,  'very  strong1); 

eln  (Report,  'strong'); 

eln  (report,  'lively'); 

In  (Report,  'weak.'); 


procedure  Write_NumRecom  (Var  Numrecom  :  integer); 
begin 

NumRecom  :=  NumRecom  +  1; 

Wri teln( Report);  Wri teln( Report); 

Writeln  (Report,  'RECOMMENDATION  NUMBER 

Writeln (Report) ;  Wri teln( Report ) ; 
end; 


:55,  NumRecom:2); 


87 


procedure  Make_Recommendations  (analysis  :  analysistype) ; 

van  NumRecom    integer;  (x  t  of  recommendations  made  so  far  X) 

begin 

Writeln  (Report);  Writeln  (Report); 
NumRecom  : =  0; 
with  analysis  do  begin 

if  (Runons  =  Unacceptable)  then  begin 
Write_NumRecom  (NumRecom); 

Writeln  (Report,  'You  tend  to  write  run-on  sentences.'); 
Writeln  (Report,  'Check  your  longest  sentences  for  run  ons.'); 
Writeln  (Report,  'Break  them  up  into  units  of  single  ideas.'); 
end; 

if  (SentLength  =  TooLong)  and  (HardWords  >  Easy)  then  begin 
Wri te_NumRecom  (NumRecom); 

Write    (Report,  'Your  average  sentences  are  too  long  *); 
Writeln  (Report,  'for  the  difficulty  of  your  vocabulary.'); 
Writeln  (Report,  'Use  simpler  words  or  snorter  sentences.1) 
end; 

if  (Modulation  =  Bad)  then  begin 
Write_NumRecom  (NumRecom); 

Write    (Report,  'Your  sentences  tend  to  be  too  short  or  too'); 
Writeln  (Report,  'long.   Try  to  moderate  and  modulate  '); 
Writeln  (Report,  'the  length  of  your  sentences.'); 
end; 

if  (Etymology  =  TooBorrowed)  then  begin 
Write_NumRecom  (NumRecom); 

Writeln  (Report,  'Use  shorter,  more  native  English  words.1); 
end; 

if  (Emotionality  =  Poor)  then  begin 
Wri te_NumRecom  (NumRecom); 

Writeln  (Report,  'Use  more  words  that  provoke  emotions.'); 
end; 

if  (Info. Genre  =  Fiction)  then  begin 
if  (Strength  <  Strong)  then  begin 
Wri te_NumRecom  (NumRecom); 

Writeln  (Report,  'Use  more  evocative,  sensory  words.*); 
end; 

if  (Etymology  <  Mixed)  and  (Hardwords  >  Challenging)  and 
(SentLength  >  Medium)  then  begin 
Write_NumRecom  (NumRecom); 

Writeln  (Report,  'Your  fiction  reads  like  non-fiction.'); 
Writeln  (Report,  'Unless  you  are  targetting  a  highly  '); 
Writeln  (Report,  'literate  audience,  keep  it  simple.*); 
end; 

if  (Tangibility  =  Soft)  then  begin 
Write_NumRecom  (NumRecom); 

Write    (Report,  'A  narrative  should  be  concrete  and  '); 
Writeln  (Report,  'detailed.   Describe  things,  not  ideas.  '); 
end; 
end  else  begin  (X  genre  is  nonfiction  X) 

if  (Etymology  =  TooNative)  and  (Strength  =  VeryStrong)  then  begin 
Write_NumRecom  (NumRecom); 

Writeln  (Report,  'Your  non-fiction  reads  like  fiction.'); 
Write    (Report,  'Ask  yourself  if  it  is  too  vigorous  '); 
Writeln  (Report,  '  for  the  audience  that  will  read  it.'); 
end; 

if  (Strength  =  Weak)  then  begin 
Write_NumRecom  (NumRecom); 

Writeln  (Report,  'Use  more  evocative,  sensory  words.'); 
end; 
end; 
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if  ( 
( 


if  ( 


end; 
If  ( 


Strength  >  Lively)  and  (Emotionality  >  Standard)  and 

Modulation  =  Good)  then  begin 

Write  (Report,  'Congratulations!   You  write  with 

Writeln  (Report/  'strength  and  grace.  '); 

end; 

Runons  =  nonexistant)  then  begin 

Write  (Report,  'Congratulations!   You  never  seem 

Writeln  (Report,  'to  write  run-on  sentences.*); 


); 


); 


end; 
end; 
end; 


NumRecom  =  0)  then  begin 

Writeln  (Report); 

Write  (Report,  'This  is  a  solid  piece  of  writing  '); 

Writeln  (Report,  'well  within  the  traditions  of  its  genre.'); 

Writeln  (Report); 

Write  (Report,  'You  are  as  able  to  understand  the  meaning  of); 

Writeln  (Report,  '  the  above  characteristics  as  The  Stylist.'); 

Writeln  (Report,  'Stand  the  course!'); 

(x  with  analysis  do  X) 

(x  procedure  make  recommendations  X) 
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procedure  Store_New_Dix; 
begin 

Rewrite  (Dixonary,  'Dixonary  text 


); 


Page;  Writeln;  Writeln;  Writeln;  Writeln; 
Write  ('Stylist  is  now  storing  the  new  dictionary .': 55) ; 
Store_BST  (Root); 
end; 


procedure  Store_BST  (p  :  entrypointer) ; 
begin 

if  (p  <>  nil)  then  begin 

if  (p-. status  =  storable)  then  Store  (p-); 
Store_BST  (p-.left); 
Store_BST  (p-. right); 
end; 
end; 


procedure  Store  (Entry  :  entrytype); 
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s 
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case  emotion  of 
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Neutral 
Unpleasant 
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emotionltr 
emotionltr 
emotionltr 
emotionltr 


end 
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itel 
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=  • 
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^  with  entry  do  X) 

n  (Dixonary,  Entry. word,  sourceltr,  difficultyltr, 

concretenessltr,  emotionltr,  vigorltr); 
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APPENDIX  D 
THE  CODE  OF  DIXSPLIT  AND  DIXJOIN 


(x    NOTE  :  THE  FOLLOWING  IS  NOT  IN  EXECUTABLE  ORDER. 

IT  IS  IN  THE  ORDER  OF  GREATEST  CONCEPTUAL  CLARITY 


x) 


(x$S60 
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000X) 

m  Dixsplit  (input,  output); 


ent 

leng 

leng 
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urn  = 

etyp 
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cety 

icul 

rate 

iont 

rtyp 
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type 

sety 

yPoi 

ordP 
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ord 
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ourc 

iffi 

oner 

moti 

igor 

eft, 

own 

ordt 
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own, 
ext, 


=  1000; 
th  =  15; 
th  =  60; 
ngth  =  4; 
=  50; 
200; 

e  =  (Nonfiction,  Fiction); 

ytype  =  (Frequently,  Seldomly,  When_Done); 

ype  =  (Plus,  Zero,  Minus); 

type  =  (Permanent,  Temporary); 

pe  =  (Latinate,  Germanic); 

tytype  =  (PostGrad,  Grad,  High_School ,  Elementary); 

nesstype  =  (Tangible,  Intangible); 

ype  =  (Sublime,  Pleasant,  Nuetral,  Unpleasant,  Horrid); 

e  =  (Violent,  Energetic,  Calm,  Inert); 

=  packed  array  ( . 1 . . wordlength . )  of  char; 

=  packed  array  ( . 1 . . linel ength . )  of  char; 
pe  =  packed  array  ( . 1 . . phraselength . )  of  wordtype; 
nter  =  -Entrytype; 
ointer  =  -Concordtype; 
e  =  record 
:  N  o  r  d  t  y  p  e ; 
ce    Balancetype; 
ion  :  Durati ontype; 
e  :  Sourcetype; 
culty  i  Dif f icultytype; 
eteness  :  Concretenesstype; 
on  :  Enotiontype; 

:  Vigortype; 

Right    EntryPointer; 
:  Concor dp ointer; 


ype  =  record 
EntryPointer; 

Last  t    ConcordPointer; 
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var 

FourRoot, 

Root  :  EntryPointer; 

Lastward  :  ConcordPointer; 

Latinat, 

Germane, 

Tangble, 

Intngble, 

Postgrd, 

Graduate, 

HighSchl, 

El ementa , 

Sublme, 

Pleasnt, 

Nuetrl , 

Unpleasn, 

Horrd, 

Violnt, 

Energeti , 

Clm, 

Inrt, 

Dixonary  :  text; 

procedure  Attach  (Entry  :  entrytype;  var  p  :  entrypointer) ; 
begin 

New  (  p  ) ; 

with  p-  do  begin 

word  :  =  entry. word; 

duration  :=  permanent; 

source  :=  entry . source; 

difficulty  :=  entry . difficulty; 

concreteness  : =  entry . concreteness; 

emotion  :=  entry . emotion; 

vigor  :=  entry. vigor; 

left  ;=  nil;  right  :=  nil; 

down  : =  nil ; 
end; 
end; 
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procedure  Encode  (its  :  entrytype;  var  xsource  :  char; 
var  xdifficulty  :  char; 
var  xconcreteness  :  char;  var  xemotion 
var  xvigor  :  char); 


char; 


begin 

if  (its. source 


=  Latinate)  then  xsource 
else  xsource 


=  'l1 
=  »g»; 


case  its. difficulty  of 

Postgrad 

xdifficulty  : 

Grad 

xdifficulty  : 

High_School 

xdifficulty  : 

Elementary 

xdifficulty  : 

end; 

if  ( its . concreteness  =  Tangib] 

case  its. emotion  of 

Sublime 

xemotion 

3   1 

Pleasant 

xemotion 

=   1 

Nuetral 

xemotion 

-        1 

Unpleasant 

xemotion 

=   1 

Horrid 

xemotion 

=   1 

end; 

case  its. vigor  c 

>f 

Violent 

xvigor 

=  «vi 

Energetic 

xvigor 

=  'e' 

Calm 

xvigor 

=  'c1 

Inert 

xvigor 

=  'i« 

end; 

end; 

procedure  Decode  < 

var  its 

entr 

•g« 

•h' 
»e' 


Tangible)  then  Xconcreteness 
else  Xconcreteness 


=  ft» 
=  i-i  t 


begin 

if  (Xsource 


xsource  :  char;  xdifficulty  :  char; 
xconcreteness  :  char;  xemotion  :  char; 
xvigor  :  char); 


1') 


then 
else 
case  Xdifficulty  of 

its . difficulty 
its . difficulty 
its . difficulty 
its . difficulty 


its . source 
its . source 


=  Latinate 
=  Germanic; 


Postgrad; 
Grad; 

High_School ; 
Elementary; 


end; 

if  (Xconcreteness3  't')  then  its . concreteness 

else  its . concreteness 


case  Xem 
•s 
'P 


•n" 

fuf 

•h' 
end; 
case  Xvi 


end; 
end; 


otion  of 
its . emotion 
its . emotion 
its . emotion 
its . emotion 
its . emotion 

gor  of 
its .vigor 
its . vigor 
its. vigor 
its.  vigor 


=  Tangible 
:  Intangible; 


=  Sublime; 

=  Pleasant; 

=  Nuetral; 

=  Unpleasant; 

=  Horrid; 


:  Violent; 
:  Energetic; 
:  Calm; 
=  Inert; 
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procedure  AVL_Insert  (Entry  :  entrytype;  var  p  :  Entrypointer; 

var  balanced  :  boolean); 
var 

pl,  p2  :  entrypointer; 
begin 

if  (p  =  nil)  then  begin 
Attach  (Entry,  p); 
p- . balance  :  =  Zero; 
balanced  :=  true; 
end 

else  if  (Entry. word  =  p-.word)  then  Writeln  ('INSERT  COLLISION1) 
else  if   (Entry. word  <  p-.word)  then  begin 
AVL_Insert  (Entry,  p-.left,  balanced); 
If  Balanced  then  (x  left  pointer  has  grown  higher  x) 
case  p-. balance  of 

Plus  :  begin  p-. balance  :=  Zero;  balanced  : =  false;  end; 
Zero    p-. balance  :=  Minus; 
Minus  ;  begin  (x  rebalance  X) 
Pi  :=  P-.left; 

if  (pi-. balance  =  Minus)  then  begin(x  single  LL  rotate) 
p-.left  :=  pi-. right; 
pl-. right  :=  p; 
p- . balance  :=  Zero; 
P  i=  pl; 
end  (X  if  X) 

else  begin  (X  double  LR  rotation  X) 
p2  : =  pl- . right; 
pl-. right  •.  =  p2-.left; 
p2-.left  :=  pl; 
p-.left  :=  p2- . right; 
p2- . right  : =  p; 

if  (p2-. balance  =  Minus)  then  p- . balance  :=  Plus 
else  p-. balance  :  =  Zero; 

if  (p2-. balance  =  Plus)  then  pl-. balance  : =  Minus 
else  pl-. balance  :=  Zero; 
p  ==  p2; 
end;  (X  else  X) 

p- . balance  :=  Zero;  balanced  : =  false; 
end;  (X  case  of  Minus  X) 
end;  (X  of  cases  X) 
end  (X  if  Entry. word  <   p-.word  X) 
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else  if  (  Entry. word  >  p-.word)  then  begin 

AVL_Insert  (Entry,  p- . right,  balanced); 
if  balanced  then  (X  right  pointer  has  grown  higher  X) 
case  p- . balance  of 

Minus  :  begin  p-. balance  : =  Zero; 

balanced  :=  false;  end; 
Zero  ;  p- . balance  :=  Plus; 
Plus  •.  begin  (x  rebalance  x) 
pl  :=  p-. right; 

if  ( pl" . balance  =  Plus)  then  begin 
(x  single  RR  *) 
p- . right  :=  pl-.left; 
pl-.left  :=  p; 
P-. balance  :=  Zero; 
p  :=  pl; 
end  (X  if  X) 

else  begin  (X  double  RL  rotation  X) 
p2  :=  pl-.left; 
pl-.left  :=  p2-. right; 
p2- . right  :=  pl; 
p- . right  ==  p2-.left; 
p2-.left  :=  p; 
if  (p2-. balance  =  Plus)  then 

p- . balance  :=  Minus 
else  p- . balance  :=  Zero; 
if  (p2-. balance  =  Minus)  then 

pl-. balance  : ~    Plus 
else  pl-. balance  :=  Zero; 
P  ==  p2; 
end;  (x  double  RL  rotation  x) 
p- . balance  :  =  Zero;  balanced  :=  false; 
end;  (x  case  of  balance  =  Plus  X) 
end;  (x  of  cases  x) 
end   (x  of  if  Entry. word  >  p-.word  X) 
else  balanced  :=  false; 
end;  (X  of  procedure  AVL  Insert  X) 
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procedure  Init_Researcher; 
var 
i , 

stepcount, 
threshhold, 
dixlength  :  integer; 
s,  d,  c,  e,  v    char; 
entry  :  entrytype; 
balanced    boolean; 
begi  n 

Reset  (Dixonary,  'Dixonary  text  a1); 

New  (Root); 

Readln  (Dixonary,  Root^.word,  s,  d,  c,  e,  v); 

Decode  (Root^,  s,  d,  c,  e,  v); 

Root^.left  ;=  nil;  Root^. right  ■•  =  nil; 

Root^ . down  : =  nil ; 

Root^. balance  :=  Zero; 

Root^ . duration  :=  Permanent; 

Lastward  : =  nil ; 

Stepcount  : =  0 ; 

while  not  EOF  (Dixonary)  do  begin 

Readln  (Dixonary,  Entry. word,  s,  d,  c,  e,  v); 
Decode  (Entry,  s,  d,  c,  e,  v); 
Entry . balance  :=  Zero;  balanced  t-    false; 
AVL_Insert  (Entry,  Root,  balanced); 
stepcount  :=  stepcount  +  1; 
if  (stepcount  mod  200  =  0)  then  begin 

Page;  Writeln;  Writeln;  Writeln;  Writeln; 
Writeln  ('Stylist  is  now  loading  its  dictionary.1); 
Writeln  ('Last  word  loaded  was  ',  entry .word : 16 ) ;  Writeln; 
Writeln  ('Stylist  has  loaded  ', stepcount : 5, '  entries.1); 
Writeln;  Writeln; 
end;  (*  if  *) 
end;  (*  while  not  EOF  x) 
end; 
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procedure  Inorder  (var  Root  :  entrypointer ) ; 
begin 

if  (Root  <>  nil)  then  begin 
Inorder  (Root- . left ) ; 
case  Root-. Source  of 

Latinate  :  Writeln  (Latinat,  Root-. word,  ,1,:1); 

Germanic  :  Writeln  (Germane,  Root-. word,  'g'tl); 
end; 
case  Root- . Concreteness  of 

Tangible  :  Writeln  (Tangble,  Root-. word,  't':l); 

Intangible  :  Writeln  (InTngble,  Root-. word,  'i'rl); 
end; 
case  Root- . difficulty  of 

Postgrad  :  Writeln  (Postgrd,  Root-. word,  'p':D; 

Grad  :  Writeln  (Graduate,  Root-. word,  'g'tl); 

High_School  :  Writeln  (HighSchl,  Root-. word,  'h':l); 

Elementary  :  Writeln  (Elementa,  Root-. word,  'e'sl); 
end; 
case  Root- . emotion  of 

Sublime  :  Writeln  (Sublme,  Root-. word,  's':l); 

Pleasant  :  Writeln  (Pleasnt,  Root-. word,  'p':D; 

Nuetral  :  Writeln  (Nuetrl,  Root-. word,  *n*:l); 

Unpleasant  :  Writeln  (Unpleasn,  Root-. word,  'u':l); 

Horrid  :  Writeln  (Horrd,  Root-. word,  'h1:!); 
end; 
case  Root-. vigor  of 

Violent  :  Writeln  (Violnt,  Root-. word,  'v^l); 

Energetic  :  Writeln  (Energeti,  Root-. word,  *e'); 

Calm  :  Writeln  (Clm,  Root-. word,  ' c ' ) ; 

Inert  :  Writeln  (Inrt,  Root-. word,  'i'); 
end; 

Inorder  (Root- . right) ; 
end; 
end; 
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begin  (x  main  program 
Init  Researcher; 


X) 


Rewrite 
Rewrite 
Rewrite 
Rewrite 


Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Rew 
Ino 
end  . 


rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rite 
rder 


Latinat, 
Germane, 
Tangble, 


•Latinat 
'Germane 
'Tangble 


text 
text 

text  a'); 


); 
); 


Intngble,  'Intngble  text  a1); 


Postg 

Gradu 

HighS 

El  erne 

Sublm 

Pleas 

Nuetr 

Unple 

Horrd 

Violn 

Energ 

Clm, 

Inrt, 

Root) 


rd,  ' 

ate. 

chl, 

nta , 

e,  'S 

nt,  • 

1,  'N 

asn, 

,  'Ho 

t,  'V 

eti, 
Clm 
•Inr 


Post 

•Gra 

•Hig 

•Ele 

ublm 

Plea 

uetr 

•Unp 

rrd 

ioln 

•Ene 

text 

t  te 


grd  text  a'); 
duate  text  a1); 
hSchl  text  a'); 
menta  text  a'); 
e  text  a ' ) ; 
snt  text  a ' ) ; 
1  text  a' ); 
leasn  text  a'); 
text  a1); 
t  text  a' ); 
rgeti  text  a'); 
a'); 


xt  a1); 
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(x$S60000*) 

program  Dixjoin   (input,  output); 

const 

maxsent  =  1000; 

wordlength  =  15; 

linelength  =  60; 

phraselength  =  4; 

freqnum  =  50; 

selnum  =  200; 

type 

Genretype  =  (Nonfiction,  Fiction); 

Frequencytype  =  (Frequently,  Seldomly,  When_Done); 

Balancetype  =  (Plus,  Zero,  Minus); 

Durationtype  =  (Permanent,  Temporary); 

Sourcetype  =  (Latinate,  Germanic); 

Dif f icul tytype  =  (PostGrad,  Grad,  High_School,  Elementary); 

Concretenesstype  =  (Tangible,  Intangible); 

Emotiontype  =  (Sublime,  Pleasant,  Nuetral,  Unpleasant,  Horrid); 

Vigortype  =  (Violent,  Energetic,  Calm,  Inert); 

Wordtype  =  packed  array  (. 1 . .wordlength . )  of  char; 
Linetype  =  packed  array  (. 1 .. linelength . )  of  char; 
Phrasetype  =  packed  array  (. 1 .. phraselength . )  of  wordtype; 

EntryPointer  =  -Entrytype; 
ConcordPointer  =  -Concordtype; 

Entrytype  =  record 

Word    Wordtype; 

Balance  :  Balancetype; 

Duration  :  Durationtype; 

Source  :  Sourcetype; 

Difficulty  :  Dif f icultytype; 

Concreteness  :  Concretenesstype; 

Emotion    Emotiontype; 

Vigor  :  Vigortype; 

Left,  Right  :  EntryPointer; 

Down    Concordpointer; 
end; 

Concordtype  =  record 

Up  :  EntryPointer; 

Down, 

Next,  Last  :  ConcordPointer; 
end; 
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var 

FourRoot , 

Root  :  EntryPointer; 

Lastward  :  ConcordPointer; 

Latinat, 

Gernanc, 

Tangble, 

Intngble, 

Postgrd, 

Graduate, 

HighSchl, 

Elementa , 

Sublme, 

Pleasnt, 

Nuetrl, 

Unpleasn, 

Horrd, 

Violnt, 

Enargeti , 

Clm, 

Inrt, 

Dixonary  :  text; 

word  :  wordtype; 

P    entrypointer; 

c  :  char; 

procedure  Attach  (Entry  :  entrytype;  var  p  :  entrypointer); 
begin 

New  ( p) ; 

with  p-  do  begin 

word  ;=  entry. word; 

duration  s=  permanent; 

source  : =  entry . source; 

difficulty  :=  entry . difficulty; 

concreteness  :=  entry . concreteness; 

emotion  :=  entry . emotion; 

vigor  :=  entry. vigor; 

left  :=  nil;  right  :=  nil; 

down  : =  nil ; 
end; 
end; 
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procedure  Encode  (its  :  entrytype;  var  xsource  i  char; 

var  xdifficulty  :  char; 

var  xconcreteness  :  char;  var  xemotion 

var  xvigor  :  char); 


char; 


begin 

if  (its. source 

case  its.diffic 
Postgrad 
Grad 

High_School 
Elementary 

end; 

if  (its.concret 

case  its.emotio 
Sublime 
Pleasant 
Nuetral 
Unpleasant 
Horrid 

end; 

case  its. vigor 
Violent 
Energetic 
Calm 
Inert 

end; 
end; 


=  Latinate)  then 

xsource 

else 

xsource 

ulty  of 

;  xdifficulty 

= 

•p'; 

:  xdifficulty 

a 

•g'; 

:  xdifficulty 

= 

•h'; 

:  xdifficulty 

= 

'e'; 

=  '1 

=   «a 


eness  =  Tangible)  then  Xconcreteness 

else  Xconcreteness 
n  of 

xemotion 

xemotion  s  =  ■ p1  ; 

xemotion 

xemotion 

xemotion 


=  't' 
=  tit 


of 


xvigor 
xvigor 
xvigor 
xvigor 


=  *v  ' 

=  'e1 

=  'c' 

=  mi 


procedure  Decode  (var  its  :  entrytype; 

xsource  :  char;  xdifficulty  :  char; 
xconcreteness  :  char;  xemotion  :  char; 
xvigor  :  char); 


begin 

if  (Xsource  =  *1')  then  its. source 

else  its. source 
case  Xdifficulty  of 

its . difficulty 
its . difficulty 
its . difficulty 
its . difficulty 


Latinate 
Germanic; 


'h' 


end; 

if  (Xconcreteness1 


=  Postgrad; 

=  Grad; 

=  High_School; 

=  Elementary; 


t')  then  i ts . concreteness 
else  its . concreteness 


end 
case 


end 
end; 


case  Xemotion  of 

its . emotion 
its . emotion 
its . emotion 
its . emotion 
its . emotion 


=  Tangible 
:  Intangible; 


h' 

Xvigor 

its 

its 

its 

its 


of 

.  vigor 

.vigor 

.vigor 

.  vigor 


=  Sublime; 

=  Pleasant; 

=  Nuetral; 

=  Unpleasant; 

=  Horrid; 


Violent; 
Energetic; 
Calm; 
:  Inert; 
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procedure  AVl_Insert  (Entry  :  entrytype;  var  p  :  Entrypointer; 

var  balanced  :  boolean); 
var 

pl,  p2  ;  entrypointer; 
begin 

if  (p  =  nil)  then  begin 
Attach  (Entry,  p); 
p-'. balance  :=  Zero; 
balanced  : =  true; 
end 

elsa  if  (Entry. word  =  p-.word)  then  Writeln  ('INSERT  COLLISION') 
else  if   (Entry. word  <  p-.word)  then  begin 
AVL_Insert  (Entry,  p-.left,  balanced); 
If  Balanced  then  (X  left  pointer  has  grown  higher  X) 
case  p-. balance  of 

Plus  :  begin  p- . balance  :  =  Zero;  balanced  : =  false;  end; 
Zero  :  p-. balance  ==  Minus; 
Minus    begin  (x  rebalance  x) 
Pl  :=  P-.left; 

if  (pl-. balance  =  Minus)  then  begin(X  single  LL  rotatx) 
p-.left  :=  pl-. right; 
pl- . right  : =  p; 
p- . balance  t-    Zero; 
p  i =  pl; 
end  (X  if  X) 

else  begin  (X  double  LR  rotation  X) 
p2  : =  pl- . right; 
pl-. right  :=  p2-.left; 
p2-.left  :=  pl; 
p-.left  :=  p2-. right; 
p2- . right  :=  p; 

if  (p2-. balance  =  Minus)  then  p- . balance  :=  Plus 
else  p- . balance  :=  Zero; 

if  (p2-. balance  =  Plus)  then  pl-. balance  :=  Minus 
else  pl-. balance  :=  Zero; 
p  • =  p2; 
end;  (x  else  X) 

p- . balance  :=  Zero;  balanced  :=  false; 
end;  (X  case  of  Minus  X) 
end;  (x  of  cases  X) 
end  (X  if  Entry. word  <   p-.word  X) 
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else  if  (  Entry. word  >  p-.word)  then  begin 

AVL_Insert  (Entry,  p- . right,  balanced); 
if  balanced  then  (*  right  pointer  has  grown  higher  x) 
case  p- . balance  of 

Minus  ;  begin  p- . balance  :=  Zero; 

balanced  :=  false;  end; 
Zero    p- . balance  :=  Plus; 
Plus  =  begin  (x  rebalance  x) 
pi  :=  p- . right; 

if  (pi-. balance  =  Plus)  then  begin 
(x  single  RR  *) 
P- . right  :=  pi-. left; 
pl-.left  :=  p; 
p- . balance  :=  Zero; 
p  : =  pi; 
end  (X  if  X) 

else  begin  (X  double  RL  rotation  x) 
p2  :=  pl-.left; 
pl-.left  :=  p2- . right; 
p2-. right  :=  pi; 
p- .  right  ••  =  p2-.left; 
P2-.left  ••=  p; 
if  (p2-. balance  =  Plus)  then 

p- . balance  :  =  Minus 
else  p- . balance  ;=  Zero; 
if  (p2-. balance  =  Minus)  then 

pi-. balance  :=  Plus 
else  pi-. balance  ••-    Zero; 
P  ==  p2; 
end;  (X  double  RL  rotation  X) 
p- .  balance  :  =  Zero;  balanced  ••-    false; 
end;  (X  case  of  balance  =  Plus  *) 
end;  (X  of  cases  X) 
end   (X  of  if  Entry. word  >  p-.word  x) 
else  balanced  :=  false; 
end;  (x  of  procedure  AVL_Insert  X) 


!. 
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procedure  Init_Researcher; 
var 
i , 

stepcount, 
threshhold, 
dixlength  •■    integer; 
s,  d,  c,  e,  v  :  char; 
entry  :  entrytype; 
balanced  :  boolean; 
begin 

Reset  (Dixonary,  'Dixonary  text  a1); 

New  (Root); 

Readln  (Dixonary,  Root-. word,  s,  d,  c,  e,  v); 

Decode  (Root-,  s,  d,  c,  e,  v); 

Root-. left  :=  nil;  Root-. right  ••-    nil; 

Root- . down  : =  ni 1 ; 

Root- . balance  :=  Zero; 

Root- . duration  :=  Permanent; 

Lastward  : =  nil ; 

Stepcount  : s  0 ; 

while  not  EOF  (Dixonary)  do  begin 

Readln  (Dixonary,  Entry. word,  s,  d,  c,  e,  v); 
Decode  (Entry,  s,  d,  c,  e,  v); 
Entry . balance  i=  Zero;  balanced  :=  false; 
AVL_Insert  (Entry,  Root,  balanced); 
stepcount  :=  stepcount  +  1; 
if  (stepcount  mod  200  =  0)  then  begin 

Page;  Hriteln;  Writeln;  Writeln;  Writeln; 
Writeln  ('Stylist  is  now  loading  its  dictionary.1); 
Writeln  ('Last  word  loaded  was  ',  entry .word : 16 ) ;  Writeln; 
Writeln  ('Stylist  has  loaded  *, stepcount : 5, '  entries.'); 
Writeln;  Writeln; 
end;  (*  if  X) 
end;  (x  while  not  EOF  x) 
end; 

procedure  Inorder  (var  Root    entrypointer; 

word  :  wordtype; 
var  P  :  entrypointer); 

begin 

if  (Root-. word  =  word)  then  p  :=  root 

else  if  (root-. word  >  word)  then  Inorder  (Root-. left,  word,  p) 

else  Inorder  (Root- . right ,  word,  p); 
end; 


procedure  Store_BST  (p  :  entrypointer); 

var  s,  d,  c,  e,  v  :  char; 

begin 

if  (p  <>  nil)  then  begin 

Encode  (p-,  s,  d,  c,  e,  v); 

Writeln  (Dixonary,  p-.word,  s,  d,  c,  e,  v); 

Store_BST  (p-.left); 

Store_BST  (p-. right); 
end; 
end; 


procedure  Store_New_Dix; 
begin 

Rewrite  (Dixonary,  'Dixonary  text  a*); 

Store_BST  (Root); 
end; 
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begin  (X  main  program  X) 

Init_Researcher; 

Reset  (Latinat,  'Latinat  text  a'); 
While  not  EOF  (Latinat)  do  begin 

Readln  (Latinat,  word,  c); 

if  (  c  <>  '1')  then  begin  Inorder 

P- . source  :=  germanic;   end; 
end; 

Reset  (Germane,  'Germane  text  a'); 
While  not  EOF  (Germane)  do  begin 

Readln  (Germane,  word,  c); 

if  (  c  <>  'g')  then  begin  Inorder 

p- . source  :=  latinate;  end; 
end; 

Reset  (Tangble,  'Tangble  text  a'); 

While  not  EOF  (Tangble)  do  begin 
Readln  (Tangble,  word,  c); 
if  (  c  <>  *t')  then  begin  Inorder  (root, 
P^ . concreteness  :=  intangible;  end; 

end; 

Reset  (Intngble,  'Intngble  text  a1); 


(root,  word,  p); 


(root,  word,  p); 


word,  p); 


While  not 
Readln 
if  (  c 


EOF  (InTngble)  do  begin 
(Intngble,  word,  c); 


<> 


i')  then  begin  Inorder  (root,  word,  p); 


p- . cone  ret en ess 
end; 


=  tangible;  end; 


Reset  (Postgrd,  'Postgrd  text  a'); 

While  not 
Readln 


if 
if 
if 
if 
end; 


(  c 
(c  = 
(c  = 
(c  = 


EOF  (Postgrd)  do  begin 
(Postgrd,  word,  c); 


<> 


«gl  ) 

'h') 

le.) 


P1)  then  begin  Inorder  (root,  word,  p); 


then  p~ . difficulty 
then  p- . difficulty 
then  p- . difficulty 


=  grad; 

=  high_school; 

=  elementary;  end; 


Reset  (Graduate,  'Graduate  text  a1); 
While  not  EOF  (Graduate)  do  begin 
Readln  (Graduate,  word,  c); 


if 
if 
if 
if 
end; 


(  c  <>  'g')  then  begin  Inorder 
(c  =  *p')  then  p- . difficulty 
(c  =  'h')  then  p- . difficulty 


(c  =  'e')  then  p- . difficulty 


(root,  word,  p); 
=  postgrad; 
=  high_school; 
=  elementary;  end; 


Reset  (HighSchl,  'HighSchl  text  a' 

While  not  EOF  (HighSchl)  do  begin 

Readln  (HighSchl,  word,  c); 


); 


if 

(    c 

if 

(c    = 

if 

(c    = 

if 

(c    = 

end; 

<>  'h')  then  begin  Inorder  (root,  word,  p); 


p')  then  p- . difficulty 
g')  then  p- . difficulty 
e')  then  p- . difficulty 


=  postgrad; 

=  grad; 

=  elementary; 


end; 


Reset  (Elementa,  'Elementa  text  a 

While  not  EOF  (Elementa)  do  begin 

Readln  (Elementa,  word,  c); 


if 
if 
if 
if 
end; 


(  c  <> 

(c  =  ' 
(c  = 
(c  = 


e1)  then  begin  Inorder  (root,  word,  p); 


tgt) 

'h') 


then  p- . difficulty 
then  p- . difficulty 
then  p- . difficulty 


=  postgrad; 

=  grad; 

=  high_school; 


end; 
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Reset 
While  not 

Readln 

if  (  c 

if 

if 

if 

if 
end; 


(Sublme,  'Sublme  text  a'); 


EOF  (Sublme)  do  begin 
(Sublme,  word,  c); 


<> 


(c 
(c 
(c 
(c 


s')  then  begin  Inorder  (root,  word,  p); 


P') 
n') 
u«) 
h») 


then 
then 
then 
then 


P^ 
P- 
P-" 
P- 


emotion 
emotion 
emotion 
emotion 


pleasant; 
nuetral ; 
unpleasant; 
horrid;  end; 


Reset  (Pleasnt,  'Pleasnt  text  a'); 
While  not  EOF  (Pleasnt)  do  begin 
Readln  (Pleasnt,  word,  c); 


if 
if 
if 
if 
if 


(  c 

(c 

(c 

(c 

(c 


<>  'p 
=  's') 
=  »n') 
=  'u') 
=  'h') 


)  then  begin  Inorder  (root,  word,  p); 


then 
then 
then 
then 


P~ 
P~ 
P~ 
P- 


emoti  on 
emotion 
emotion 
emotion 


sublime; 
nuetral ; 
unpleasant; 
horrid;   end; 


ena. 


Reset  (Nuetrl,  'Nuetrl  text  a') 
While  not 
Readln 


EOF  (Nuetrl)  do  begin 
(Nuetrl,  word,  c); 


if 
if 
if 
if 
if 
end; 


(  c 

(c  ■ 

(C  : 

(c  ■■ 

(C   : 


<> 


n')  then  begin  Inorder  (root,  word,  p); 


's') 


h') 


then 
then 
then 
then 


P^ 
P~ 
P- 

P~ 


emotion 
emotion 
emotion 
emotion 


sublime; 
pleasant; 
unpleasant; 
horrid;  end; 


Reset 
While  not 

Readln 

if  (  c 

if 

if 

if 

if 
end; 


(Unpleasn,  'Unpleasn  text  a'); 


(c 
(c 
(c 
(c 


EOF  (Unpleasn)  do  begin 

(Unpleasn,  word,  c); 

<>  'u')  then  begin  Inorder  (root,  word,  p); 


s') 
P') 
n') 
h») 


then 
then 
then 
then 


P- 
P^ 
P^ 

P~ 


emotion 
emotion 
emotion 
emotion 


sublime; 
pleasant; 
nuetral ; 
horrid;  end; 


Reset  (Horrd,  'Horrd  text  a'); 

While  not  EOF  (Horrd)  do  begin 

Readln  (Horrd,  word,  c); 


if 
if 
if 
if 
if 
end; 


(  c 

(c  : 
(c  ■ 
(c  ■ 

(C   : 


<> 


h 
s1) 
P') 
n') 

u«) 


)  then  begin  Inorder  (root,  word,  p); 


then 
then 
then 
then 


P~ 
P- 
P~ 
P^ 


emotion 
emotion 
emotion 
emotion 


sublime; 
pleasant; 
nuetral ; 
unpleasant; 


end; 


Reset 
While  not 

Readln 

if  (  c 

if 

if 

if 
end; 


(Violnt,  'Violnt  text  a'); 


(c  = 
(c  = 
(c  = 


EOF  (Violnt)  do  begin 
(Violnt,  word,  c); 
<>  'v')  then  begin  Inorder 
•e') 


then  p-^. vigor 
c  •  )  then  p- .vigor 
i')  then  p- . vigor 


( root, 
=  energetic; 
=  calm; 
=  inert;  end; 


word,  p); 


Reset  (Energeti,  'Energeti  text  a*); 

While  not 
Readin 


EOF  (Energeti)  do  begin 
(Energeti,  word,  c); 


IT 

if 
if 

if 
end; 


(  c  <>  'e')  then  begin  Inorder  (root,  word,  p); 

(c  = 
(c  = 
(c  = 


) 

c1) 
i1) 


then  p- . vigor 
then  p- . vigor 
then  p- . vigor 


=  violent; 

=  calm; 

=  inert;  end; 
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Reset 
While  not 

Readln 

if  (  c 

if 

if 

if 
end; 


(Clm,  »Clm  text  a1) 


(c  = 
(c  = 
(c  = 


EOF  (Clm)  do  begin 

(Clm,  word,  c); 

<>  'c')  then  begin  Inorder  (root,  word,  p); 


v ' )  then  p- .vigor 
ef)  then  p-. vigor 
i1)  then  p- . vigor 


violent; 
=  energetic; 
=  inert;  end; 


Reset  (Inrt,  'Inrt  text  a'); 

While  not 
Readln 


if 

if 
if 
if 
end; 


(  c 
(c  = 

(c  = 
(c  = 


EOF  (Inrt)  do  begin 

(Inrt,  word,  c); 

<>  *i')  then  begin  Inorder 


(root,  word,  p); 


•v«) 
•ef  ) 

■c') 


then  p- . vigor 
then  p^. vigor 
then  p- . vigor 


=  violent; 
=  energetic; 
=  calm;  end; 


Store_New_Dix; 


end 
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Plato's  Phaedrus 


APPENDIX  E 
REPRESENTATIVE  RUNS 


PROFILE 


Total  of  sentences 
Total  of  words 
Total  of  letters 


17 

360 
1595 


360  words  are  too  few  for  valid  statistical  analysis 


Of  the     360  words   in  Plato's  Phaedrus 

312  were  matched  to  words  in  the  Stylist  dictionary. 

86  '/.      were  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 


30 

23 

26 

24 

22 

20 

18 

16 

14 

12 

10 

8 

6 

4 

2 


X 

• 

• 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

• 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

3 

4 

5 

6 

7 

8 

9 

10 

11    12    13    14    15 


Number  of  Letters  in  a  Word 

The  average  number  of  letters  per  word       6 . 131428571E+00 

A  typical  nonfiction  texts  distribution 

resembles  a  low  bell-shaped  curve  centered  around  six  letters/word, 

The  length  of  the  words  is  medium 
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1 

BRE 

Mum 

ber 

of  words 

100 

98 

96 

94 

92 

90 

88 

36 

84 

82 

80 

78 

76 

74 

72 

70 

68 

66 

64 

62 
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56 

54 

52 
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46 

44 

42 

40 
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38 
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36 
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34 
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32 
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XX 

30 

X 

XX 

28 
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XX   X 

26 

X 

XX   X 

24 
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X  X 

XX   X 

22 

X 

XXXX 

XX   X 

20 

X 

xxxx 

XX   X 

13 

X 

xxxxx 

XXX  X 

16 

XXXXXXXX 

XXX  X 

14 

xxxxxxxx 

XXX  X 

12 

xxxxxxxxxxxxxxx  x 

10 

xxxxxxxxxxxxxxxxx 

3 

xxxxxxxxxxxxxxxxx 

6 

xxxxxxxxxxxxxxxxx 

4 

xxxxxxxxxxxxxxxxx 

2 

xxxxxxxxxxxxxxxxx 

10         20 

BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 


30         40         50        60        70 
Sentence  Number 

The  average  number  of  words  per  sentence       :   2 . 117647059E+01 

A  typical  modern  texts  sentences  average   between  fifteen  and  twenty  words 

Sentences  are  long. 

Number  of  run  ons  0 
Percent  of  run  ons  0 
Run  ons  are  nonexistant. 
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Number  of  medium  length  sentences      9 
Percent  of  medium  length  sentences    52 


Modulation  is   good 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words 
Number  of  Germanic  words 

Etymology  is  very  native. 


51 
95 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


0 

1 

25 

120 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


Difficulty  is  easy. 


TANGIBILITY 


Number  of  Tangible  words 
Number  of  Intangible  words 

Tangibility  is   tangibible. 


35 
111 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


7 

44 

87 

8 

0 


Percent  of  sublime  connotations 

Percent  of  pleasant  connotations 

Percent  of  unpleasant  connotations 

Percent  of  horrid  connotations 

Index  of  Emotionality  42 

Emotionality  is  Rich 

Tone  is  Positive 


2 

14 
2 
0 
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VIGOR  OF  WORDS 


Words  of  Extreme  Vigor 

Words  of  Much  Vigor 

Words  of  Some  Vigor 

Words  of  Little  Vigor 


0 

17 
75 
54 


Percent  of  words  of  extreme  vigor 
Percent  of  words  of  much  vigor 
Percent  of  words  of  some  vigor 


Index  of  Vigor 
Vigor  is  lively 


49 


0 

5 

24 


Congratulations!   You  never  seem  to  write  run  on  sentences. 

This  is  a  solid  piece  of  writing  well  within  the  traditions  of  its  genre 
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CONCORDANCE 
Phaedrus 

them  and  thamus  enquired    ABOUT    their  several  uses  and 

discovered  is  not  an     AID    to  memory  but  to 

other  egyptians  might  be    ALLOWED    to  have  the  benefit 

an  art  is  not    ALWAYS    the  best  judge  of 

is  called  by  them    AMMON    to  him  came  theuth 

learned  nothing  they  will     APPEAR    to  be  omniscient  and 

censured  others  as  he     APPROVED    or  disapproved  of  them 

many  arts  such  as     ARITHMETIC    and  calculation  and  geometry 

or  inventor  of  an     ART    is  not  always  the 

the  inventor  of  many     ARTS    such  as  arithmetic  and 

blame  of  the  various     ARTS    but  when  they  came 

calculation  and  geometry  and     ASTRONOMY    and  draughts  and  dice 

have  been  led  to     ATTRIBUTE    to  them  a  quality 

in  the  learners'  souls     BECAUSE    they  will  not  use 

your  own  children  have     BEEN    led  to  attribute  to 

allowed  to  have  the     BENEFIT    of  them  he  enumerated 

is  not  always  the    BEST    judge  of  the  utility 

wiser  and  give  them     BETTER    memories  it  is  a 

name  was  theuth  the    BIRD   which  is  called  the 

theuth  in  praise  or     BLAME    of  the  various  arts 

it  is  a  specific     BOTH    for  the  memory  and 

such  as  arithmetic  and     CALCULATION    and  geometry  and  astronomy 

egypt  which  the  hellenes     CALL    egyptian  thebes  and  the 

the  bird  which  is     CALLED    the  ibis  is  sacred 

the  god  himself  is     CALLED    by  them  ammon  to 

them  ammon  to  him     CAME    theuth  and  showed  his 

arts  but  when  they     CAME    to  letters  this  said 

a  quality  which  they    CANNOT    have  for  this  discovery 

some  of  them  and     CENSURED    others  as  he  approved 

to  the  external  written     CHARACTERS    and  not  remember  of 

love  of  your  own     CHILDREN    have  been  led  to 

socrates  at  the  egyptian     CITY    of  naucratis  there  was 

dwelt  in  that  great     CITY    of  upper  egypt  which 

they  will  be  tiresome     COMPANY    having  the  show  of 

king  of  the  whole     COUNTRY    of  egypt  and  he 

discovery  of  yours  will     CREATE    f orgetf ulness  in  the  learners' 

letters  now  is  those     DAYS    the  god  thamus  was 

and  showed  his  inventions     DESIRING    that  the  other  egyptians 

astronomy  and  draughts  and     DICE    but  his  great  discovery 

as  he  approved  or     DISAPPROVED    of  them  it  would 

and  you  give  your     DISCIPLES    not  truth  but  only 

specific  which  you  have    DISCOVERED    is  not  an  aid 

dice  but  his  great     DISCOVERY   was  the  use  of 

cannot  have  for  this     DISCOVERY    of  yours  will  create 

geometry  and  astronomy  and     DRAUGHTS    and  dice  but  his 

of  egypt  and  he     DWELT    in  that  great  city 

the  whole  country  of     EGYPT    and  he  dwelt  in 

great  city  of  upper     EGYPT    which  the  hellenes  call 

socrates  at  the     EGYPTIAN    city  of  naucratis  there 

which  the  hellenes  call     EGYPTIAN    thebes  and  the  god 

desiring  that  the  other    EGYPTIANS    might  be  allowed  to 

theuth  will  make  the     EGYPTIANS    wiser  and  give  them 

enumerated  them  and  thamus     ENQUIRED    about  their  several  uses 

benefit  of  them  he     ENUMERATED    them  and  thamus  enquired 

will  trust  to  the     EXTERNAL    written  characters  and  not 

naucratis  there  was  a     FAMOUS    old  god  whose  name 

you  who  are  the     FATHER    of  letters  from  the 

of  yours  will  create    FORGETFULNESS    in  the  learners'  souls 

be  omniscient  and  will     GENERALLY   know  nothing  they  will 

arithmetic  and  calculation  and     GEOMETRY    and  astronomy  and  draughts 

the  egyptians  wiser  and     GIVE    them  better  memories  it 

to  reminiscence  and  you     GIVE    your  disciples  not  truth 
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was  a  famous  old     GOD    whose  name  was  theuth 
is  those  days  the     GOD    thamus  was  the  king 
egyptian  thebes  and  the     GOD    himself  is  called  by 
and  dice  but  his     GREAT    discovery  was  the  use 
he  dwelt  in  that"    GREAT    city  of  upper  egypt 
will  be  tiresome  company    HAVING    the  show  of  wisdom 
truth  they  will  be     HEARERS    of  many  things  and 
upper  egypt  which  the    HELLENES    call  egyptian  thebes  and 
thebes  and  the  god     HIMSELF    is  called  by  them 
which  is  called  the     IBIS    is  sacred  to  him 
thamus  replied  o  most     INGENIOUS    theuth  the  parent  or 
them  and  in  this     INSTANCE    you  who  are  the 
of  the  utility  or     INUTILITY    of  his  own  inventions 
theuth  and  showed  his     INVENTIONS    desiring  that  the  other 
inutility  of  his  own     INVENTIONS    to  the  users  of 
and  he  was  the     INVENTOR    of  many  arts  such 
theuth  the  parent  or     INVENTOR    of  an  art  is 
not  always  the  best     JUDGE    of  the  utility  or 
god  thamus  was  the     KING    of  the  whole  country 
omniscient  and  will  generally     KNOW    nothing  they  will  be 
things  and  will  have     LEARNED    nothing  they  will  appear 
create  f orgetf ulness  in  the     LEARNERS'    souls  because  they  will 
own  children  have  been     LED    to  attribute  to  them 
was  the  use  of     LETTERS    now  is  those  days 
when  they  came  to     LETTERS    this  said  theuth  will 
are  the  father  of     LETTERS    from  the  paternal  love 
it  would  take  a     LONG    time  to  repeat  all 
letters  from  the  paternal     LOVE    of  your  own  children 
this  said  theuth  will     MAKE    the  egyptians  wiser  and 
was  the  inventor  of     MANY    arts  such  as  arithmetic 
will  be  hearers  of     MANY    things  and  will  have 
and  give  them  better     MEMORIES    it  is  a  specific 
will  not  use  their     MEMORIES    they  will  trust  to 
specific  both  for  the     MEMORY    and  for  the  wit 
not  an  aid  to     MEMORY    but  to  reminiscence  and 
that  the  other  egyptians     MIGHT    be  allowed  to  have 
wit  thamus  replied  o     MOST    ingenious  theuth  the  parent 
famous  old  god  whose     NAME    was  theuth  the  bird 
the  egyptian  city  of     NAUCRATIS    there  was  a  famous 
and  will  have  learned     NOTHING    they  will  appear  to 
and  will  generally  know     NOTHING    they  will  be  tiresome 
the  wit  thamus  replied     0    most  ingenious  theuth  the 
there  was  a  famous     OLD    god  whose  name  was 
will  appear  to  be     OMNISCIENT    and  will  generally  know 
inventions  desiring  that  the     OTHER    egyptians  might  be  allowed 
of  them  and  censured     OTHERS    as  he  approved  or 
or  inutility  of  his     OWN    inventions  to  the  users 
paternal  love  of  your     OWN    children  have  been  led 
most  ingenious  theuth  the     PARENT    or  inventor  of  an 
of  letters  from  the     PATERNAL    love  of  your  own 
said  to  theuth  in     PRAISE    or  blame  of  the 
their  several  uses  and     PRAISED    some  of  them  and 
attribute  to  them  a     QUALITY    which  they  cannot  have 
of  wisdom  without  the     REALITY 

written  characters  and  not    REMEMBER   of  themselves  the  specific 
to  memory  but  to     REMINISCENCE    and  you  give  your 
a  long  time  to     REPEAT    all  that  thamus  said 
for  the  wit  thamus     REPLIED    o  most  ingenious  theuth 
called  the  ibis  is     SACRED    to  him  and  he 
repeat  all  that  thamus     SAID    to  theuth  in  praise 
came  to  letters  this     SAID    theuth  will  make  the 
truth  but  only  the     SEMBLANCE    of  truth  they  will 
thamus  enquired  about  their     SEVERAL    uses  and  praised  some 
tiresome  company  having  the     SHOW    of  wisdom  without  the 
him  came  theuth  and     SHOWED    his  inventions  desiring  that 
SOCRATES    at  the  egyptian  city 


114 


several  uses  and  praised     SOME    of  them  and  censured 

f orgetf ulness  in  the  learners1     SOULS    because  they  will  not 

memories  it  is  a     SPECIFIC    both  for  the  memory 

remember  of  themselves  the    SPECIFIC   which  you  have  discovered 

of  them  it  would    TAKE   a  long  time  to 

those  days  the  god    THAMUS   was  the  king  of 

he  enumerated  them  and    THAMUS    enquired  about  their  several 

to  repeat  all  that    THAMUS    said  to  theuth  in 

and  for  the  wit     THAMUS    replied  o  most  ingenious 

the  hellenes  call  egyptian     THEBES    and  the  god  himself 

and  thamus  enquired  about     THEIR    several  uses  and  praised 

they  will  not  use     THEIR    memories  they  will  trust 

and  not  remember  of     THEMSELVES    the  specific  which  you 

egyptian  city  of  naucratis     THERE    was  a  famous  old 

god  whose  name  was     THEUTH    the  bird  which  is 

ammon  to  him  came     THEUTH    and  showed  his  inventions 

that  thamus  said  to     THEUTH    in  praise  or  blame 

to  letters  this  said     THEUTH    will  make  the  egyptians 

replied  o  most  ingenious     THEUTH    the  parent  or  inventor 

be  hearers  of  many     THINGS    and  will  have  learned 

of  letters  now  is     THOSE    days  the  god  thamus 

would  take  a  long     TIME    to  repeat  all  that 

nothing  they  will  be     TIRESOME    company  having  the  show 

their  memories  they  will     TRUST    to  the  external  written 

give  your  disciples  not     TRUTH    but  only  the  semblance 

only  the  semblance  of    TRUTH    they  will  be  hearers 

that  great  city  of     UPPER    egypt  which  the  hellenes 

great  discovery  was  the     USE    of  letters  now  is 

because  they  will  not     USE    their  memories  they  will 

own  inventions  to  the     USERS    of  them  and  in 

enquired  about  their  several     USES    and  praised  some  of 

best  judge  of  the     UTILITY    or  inutility  of  his 

or  blame  of  the     VARIOUS    arts  but  when  they 

the  various  arts  but     WHEN    they  came  to  letters 

was  theuth  the  bird     WHICH    is  called  the  ibis 

city  cf  upper  egypt     WHICH    the  hellenes  call  egyptian 

to  them  a  quality     WHICH    they  cannot  have  for 

of  themselves  the  specific     WHICH    you  have  discovered  is 

the  king  of  the    WHOLE    country  of  egypt  and 

a  famous  old  god     WHOSE    name  was  theuth  the 

having  the  show  of     WISDOM    without  the  reality 

will  make  the  egyptians     WISER    and  give  them  better 

memory  and  for  the     WIT    thamus  replied  o  most 

the  show  of  wisdom     WITHOUT    the  reality 

disapproved  of  them  it     WOULD    take  a  long  time 

trust  to  the  external     WRITTEN    characters  and  not  remember 

for  this  discovery  of     YOURS    will  create  f orgetf ulness  in 
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Student  Paper.   Example  of  Poor  Writing 

PROFILE 


Total  of  sentences 
Total  of  words 
Total  of  letters 


62 

2026 

10561 


2026  words  are    plenty  for  valid  statistical  analysis. 


Of  the    2026  words   in  Student  Paper.   Example  of  Poor  Writing  , 
1839  were  matched  to  words  in  the  Stylist  dictionary. 

90  '/.      were  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 
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X 

X 

X 
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X 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12    13    14    15 


Number  of  Letters  in  a  Word 

The  average  number  of  letters  per  word    :   6 . 921887713E+00 

A  typical  nonfiction  texts  distribution 

resembles  a  low  bell-shaped  curve  centered  around  six  letters/word. 

The  length  of  the  words  is  long. 
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Number  of  words 
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BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 
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XXXXX  XX  XXXXXXXX  xxxxx 
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XXXXX  XXX  XXXXXXXXXXXXXX  XXXXXXXXXXXXXXX  XXXXXXXXX 
xxxxx  XXX  XXXXXXXXXXXXXX  XXXXXXXXXXXXXXXXXXXXXXXXX 

xxxxx  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxx  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

10        20        30        40        50        60 

Sentence  Number 


XXXXXXXX  XX 


70 


The  average  number  of  words  per  sentence 


3.267741935E+01 


A  typical  modern  texts  sentences  average   between  fifteen  and  twenty  words 

Sentences  are    too  long. 

Number  of  run  ons     13 
Percent  of  run  ons    20 

Run  ons  are  unacceptable 


118 


Number  of  medium  length  sentences     14 
Percent  of  medium  length  sentences    22 


Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words 
Number  of  Germanic  words 

Etymology  is  mixed 


708 
402 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


26 
141 
405 
538 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


1 

7 

22 


Difficulty  is  very  hard 


TANGIBILITY 


Number  of  Tangible  words 
Number  of  Intangible  words 

Tangibility  is   tangibible. 


343 

767 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


7 

166 

861 

75 

1 


Percent  of  sublime  connotations 

Percent  of  pleasant  connotations 

Percent  of  unpleasant  connotations 

Percent  of  horrid  connotations 

Index  of  Emotionality  26 

Emotionality  is  Rich 

Tone  is  Positive 
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VIGOR  OF  WORDS 


Words  of  Extreme  Vigor 
Words  of  Much  Vigor 
Words  of  Some  Vigor 
Words  of  Little  Vigor 


1 

94 

464 

551 


Percent  of  words  of  extreme  vigor 
Percent  of  words  of  much  vigor 
Percent  of  words  of  some  vigor 


Index  of  Vigor 
Vigor  is  lively 


50 


0 

5 

25 


RECOMMENDATION  NUMBER 


You  tend  to  write  run-on  sentences. 
Check  your  longest  sentences  for  run  ons 
Break  them  up  into  units  of  single  ideas 


RECOMMENDATION  NUMBER   2 


Your  average  sentences  are    too  long  for  the  difficulty  of  your  vocabulary 
Use  simpler  words  or  shorter  sentences. 
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Excerpt  from  A  Farewell  to  Arms 


PROFILE 


Total  of  sentences 
Total  of  words 
Total  of  letters 


65 
1349 
5431 


1349  words  are    plenty  for  valid  statistical  analysis. 


Of  the    1349  words   in  Excerpt  from  A  Farewell  to  Arms 
1197  were  matched  to  words  in  the  Stylist  dictionary. 

88  '/.      were  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 
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6 
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Number  of  Letters  in  a  Word 

The  average  number  of  letters  per  word    :   5 . 100817439E+00 

A  typical  fiction  texts  distribution 
resembles  a  tall  bell-shaped  curve  centered  around  five  letters/word, 

The  length  of  the  words  is  medium 
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1 

Nu 
10 
98 
96 
94 
92 
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83 
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84 
82 
80 
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BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 
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XX  XXX 
XXXX  XXXXXXXXXXXXXXXXXX  xxxxxxx  xxxxxxxxxxxx 
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xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

10        20        30        40        50        60 

Sentence  Number 


70 


The  average  number  of  words  per  sentence 


2.075384615E+01 


A  typical  modern  texts  sentences  average   between  fifteen  and  twenty  words 

Sentences  are  long. 

Number  of  run  ons      3 
Percent  of  run  ons     4 

Run  ons  are  acceptable. 
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Number  of  medium  length  sentences     26 
Percent  of  medium  length  sentences    40 


Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words 
Number  of  Germanic  words 

Etymology  is  very  native. 


137 
484 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


0 

0 

25 

596 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


Difficulty  is  easy. 


TANGIBILITY 


Number  of  Tangible  words 
Number  of  Intangible  words 

Tangibility  is   tangibible. 


179 
442 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


1 

135 

450 

34 

1 


Percent  of  sublime  connotations 

Percent  of  pleasant  connotations 

Percent  of  unpleasant  connotations 

Percent  of  horrid  connotations 

Index  of  Emotionality  26 

Emotionality  is  Rich 

Tone  is  Positive 


0 

11 

2 

0 
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VIGOR  OF  WORDS 


Words  of  Extreme  Vigor 
Words  of  Much  Vigor 
Words  of  Some  Vigor 
Words  of  Little  Vigor 


73 
324 
220 


Percent  of  words  of  extreme  vigor 
Percent  of  words  of  much  vigor 
Percent  of  words  of  some  vigor 


Index  of  Vigor 
Vigor  is  strong 


57 


0 

6 

27 


This  is  a  solid  piece  of  writing  well  within  the  traditions  of  its  genre. 

You  are  as  able  to  understand  the  meaning  of  the  above  characteristics  as  The  St 

ylist. 

Stand  the  course! 
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Computer  Science  Text   1 


PROFILE 


Total  of  sentences 
Total  of  words 
Total  of  letters 


56 
1178 
5868 


1178  words  are    plenty  for  valid  statistical  analysis 


Of  the    1178  words   in  Computer  Science  Text   1 

1026  were  matched  to  words  in  the  Stylist  dictionary. 

87  '/.      were  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 


30 

28 

26 

24 

22 

20 
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14 
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10 

8 

6 

4 
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X 

X 

X 

X 

• 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13    14    15 
Number  of  Letters  in  a  Word 


The  average  number  of  letters  per   word    :   6  .  500000000E+00 

A  typical  nonfiction  texts  distribution 

resembles  a  low  bell-shaped  curve  centered  around  six  letters/word 

The  length  of  the  words  is  medium 


127 


1  BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 

Number  of  words 

100 

98 

96 

94 

92 

90 

8S 

86 

84 

82 

80 

78 

76 

74 

72 

70 

68 

66 

64 

62 

60 

58 

56 

54 

52 

50 

48 

46  X 

44  XX 

42  XX  X 

40  XX  X 

38  XXX  X 

36  X  X        X      X  X 

34  X  X        XX     X  X 

32      X  XX        XX    XX  XX 

30      X  XX        XX    XX      X      X     X 

28      X  XXX        XX    XX      X       X  X  XXX 

26      X  X     X  X   X      XX   XXX   XX     XXX  XXX 

24      XX      X     X  X   X      XX   XXX   X  XX     XXX  XXX 

22      XX      X     X  X   X      XX   XXX   X  XX     XXX  XXX 

20      XX    X      X   XXX  X   XX     XX   XXXX  X  XX     XXX  XXX 

18   X   XX    XX    XX  XXXXX   XX     XX   XXXX  X  XX     XXX  XXX 

16   XXXXXXX  X  X   XXX  XXXXX   XXXX  XXX  XXXXXXXXXXX    XXX  XXX 

14   XXXXXXX  X  X   XXXXXXXXX   XXXX  XXX  XXXXXXXXXXX  X  X  XXX  XXX 

12   XXXXXXX  XXX   XXXXXXXXX   XXXXXXXXXXXXXXXXXXXXXX  X  XXX  XXX 

10  XXXXXXX  XXX  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  XXXXXXX 
8  XXXXXXX  XXX  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  XXXXXXX 
6  XXXXXXX  XXX  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
4  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
2   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

10        20         30         40         50        60        70 

Sentence  Number 

The  average  number  of  words  per  sentence       :   2 . 103571428E+01 


A  typical  modern  texts  sentences  average   between  fifteen  and  twenty  words 

Sentences  are  long. 

Number  of  run  ons      1 
Percent  of  run  ons     1 

Run  ons  are  acceptable. 
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Number  of  medium  length  sentences     28 
Percent  of  medium  length  sentences    50 


Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words 
Number  of  Germanic  words 

Etymology  is  mixed 


422 
227 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


1 

55 

273 

320 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


0 

5 

26 


Difficulty  is  challenging 


TANGIBILITY 


Number  of  Tangible  words     : 
Number  of  Intangible  words   : 

Tangibility  is   very  intangibible . 


106 
543 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


1 

50 

562 

32 

4 


Percent  of  sublime  connotations 

Percent  of  pleasant  connotations 

Percent  of  unpleasant  connotations 

Percent  of  horrid  connotations 

Index  of  Emotionality  14 

Emotionality  is  average 

Tone  is  Positive 
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VIGOR  OF  WORDS 


Words  of  Extreme  Vigor 
Words  of  Much  Vigor 
Words  of  Some  Vigor 
Words  of  Little  Vigor 


7 
34 

301 
307 


Percent  of  words  of  extreme  vigor 
Percent  of  words  of  much  vigor 
Percent  of  words  of  some  vigor 


Index  of  Vigor 
Vigor  is  lively 


<+4 


0 

3 

29 


This  is  a  solid  piece  of  writing  well  within  the  traditions  of  its  genre. 

You  are  as  able  to  understand  the  meaning  of  the  above  characteristics  as  The  St 

ylist . 

Stand  the  course! 
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Excerpt  from  Galapagos  by  Vonnegut 

PROFILE 


Total  of  sentences 
Total  of  words 
Total  of  letters 


22 

542 
2526 


542  words  are  enough  for  valid  statistical  analysis 


Of  the     542  words   in  Excerpt  from  Galapagos  by  Vonnegut 
449  were  matched  to  words  in  the  Stylist  dictionary. 

82  V.      were  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 


30 

28 
26 
24 
22 

20 
18 
16 
14 
12 
10 
S 
6 
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X 
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X 

X 

X 

X 

X 

X 

X 

3 

4 

5 

6 

7 

8 

9 

10 

11    12    13    14    15 


Number  of  Letters  in  a  Word 

The  average  number  of  letters  per  word    ••   5 .  760233918E+00 

A  typical  fiction  texts  distribution 
resembles  a  tall  bell-shaped  curve  centered  around  five  letters/word, 

The  length  of  the  words  is  long. 
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1  BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 

Number  of  words 

100 

98 

96 

94 

92 

90 

88 

86 

84 

82 

80 

78 

76 

74 

72 

70 

63 

66 

64 

62 

60 

53 

56 

54 

52 

50 

48  X 

46  X  X 

44  XX  X 

42  XX  X 

40  X        X  X  X 

38  XX      X  X  X 

36  XX      XX  X  X 

34      X  XX      XX  X  X 

32      X   X        XX      XX  X  X 

30      X   X        XX      XX  X  X 

28      X   X        XX      XX  X  X 

26      X   X        XX      XX  X  X 

24      X   X        XX      XX  X  X 

22      X   X        XX      XX  XXX 

20      XXX      XXX      XX  XXX 

18   XXX   XXX   XXX  XX   XX 

16   XXX   XXX  XXXX  X  XX  X  X 

14   XXX  XXXX  XXXXXXXXXXX  X 

12   XXX  XXXX  XXXXXXXXXXX  X 

10  XXX  XXXX  xxxxxxxxxxxxx 

8  xxxxxxxx  xxxxxxxxxxxxx 

6  xxxxxxxxxxxxxxxxxxxxxx 

4  xxxxxxxxxxxxxxxxxxxxxx 

2  xxxxxxxxxxxxxxxxxxxxxx 

10        20        30        40        50        60        70 

Sentence  Number 

The  average  number  of  words  per  sentence       :   2 . 463636364E+01 


A  typical  modern  texts  sentences  average   between  fifteen  and  twenty  words 

Sentences  are  too  long. 

Number  of  run  ons     2 
Percent  of  run  ons     9 

Run  ons  are  unacceptable 
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Number  of  medium  length  sentences     10 
Percent  of  medium  length  sentences    45 

Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words 
Number  of  Germanic  words 


86 
184 


Etymology  is  very  native. 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


0 

3 

49 

218 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


0 

0 

10 


Difficulty  is  easy. 


TANGIBILITY 


Number  of  Tangible  words 
Number  of  Intangible  words 

Tangibility  is   tangibible. 


76 
194 
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EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


2 

79 

170 

13 

6 


Percent  of  sublime  connotations 
Percent  of  pleasant  connotations 
Percent  of  unpleasant  connotations 
Percent  of  horrid  connotations 

Index  of  Emotionality  43 

Emotionality  is  Rich 

Tone  is  Positive 


0 

17 

2 

1 


VIGOR  OF  WORDS 


Words  of  Extreme  Vigor 
Words  of  Much  Vigor 
Words  of  Some  Vigor 
Words  of  Little  Vigor 


3 

39 

123 

105 


Percent  of  words  of  extreme  vigor 
Percent  of  words  of  much  vigor 
Percent  of  words  of  some  vigor 


Index  of  Vigor 
Vigor  is  very  strong 


67 


0 

8 

27 


RECOMMENDATION  NUMBER 


You  tend  to  write  run-on  sentences. 
Check  your  longest  sentences  for  run  ons. 
Break  them  up  into  units  of  single  ideas. 
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